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In accordance with the present invention, it has been 
discovsred that nuclear receptor proteins isolated from the 
silk moth bombyx mori (bR) are useful for the regulation 
of transgene expression. bR is generally thought to be 
a strong transcriptional regulator within cells of the silk 
moth. In accordance with the present invention, it has 
been discovered that bR is also functional in mammalian 
cells. It has further been discovered that the addition 
of activation domains to the bR open-reading frame 
enhances the activity of the ligand modulated regulator 
to afford high-level transcriptional induction. Further 
modifications to the bR ligand binding domain result in 
receptors with unique tranactivational characteristics. 
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Modified Lei^idor^teran R-ceotors and Hybrid 
l^ultifunctional Proteins for Use in Regulation of 
Transaene Expression 

ACICNQWLSDGMENT 

This invention was made with Government support under 
Grant No. AG 1043 5, awarded by the National Institutes of 
Health. The Government has certain rights in the 
5 invention. 

RELATED APPLICATIONS 

This application claims priority from Provisional 

Application, U.S. S.N. 60/ , filed July 1 , 1998, and 

also claims priority from United States Serial No. 
IC 08/891,298, filed July 10, 1997, now pending, the entire 
contents of both of which are hereby incorporated by 
reference herein. 

FIELD OF THE INVENTION 

The present invention relates to methods in the field 
15 of recombinant DNA. technology, and products related 
thereto. In a particular aspect, the invention relates to 
m.ethods for modulating the expression of exogenous genes in 
mammalian systems, and products useful therefor. 

BACKGROUND OF THE IN\^ENTION 

20 In the field of genetic engineering, precise control 

of gene expression is an invaluable tool for studying, 
manipulating and controlling development and other 
physiological processes. For example applications for 
regulated gene expression in mammalian systems include 

25 inducible gene targeting, overexpression of toxic and 
teratogenic genes, anti-sense RNA expression, and gene 
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therapy (see, for example, Jaenisch, R. (1988) Science 240, 
1468-1474) . For cultured cells, glucocorticoids and other 
steroids have been used to induce the expression of a 
desired gene. 

5 As another means for controlling gene expression in 

mammalian systems, an inducible tetracycline regulated 
system has been devised and utilized in transgenic mice, 
whereby gene activity is induced in the absence of the 
antibiotic and repressed in its presence (see, e.g, Gossen 

10 et al. (1992) PNAS 89, 5547-5551; Gossen et al.(1993) TIBS 
18, 471-475; Furth et al . (1994) PNAS 91, 9302-9306; and 
Shockett et al . (1995) PNAS 92, 6522-6526). However, 
disadvantages of the inducible tetracycline system include 
the requirement for continuous administration of 

15 tetracycline to repress expression and the slow clearance 
of antibiotic from bone, which interferes with regulation 
of gene expression. While this system has been improved by 
the recent identification of a mutant tetracycline 
repressor which acts conversely as an inducible activator, 

20 the pharmacokinetics of tetracycline may hinder its use 
during development when a precise and efficient "on-off" 
switch is essential (see, e.g., Gossen et al . (1995) 
Science 268, 1766-1769) . 

Historically, the expression of transgenes with 
25 potential pathogenic and cytotoxic properties has been 
difficult in the context of stable cell lines since cells 
chronically producing significant levels of the toxic 
transgene generally die from the cytopathic effects. For 
example, CAG repeats and concomitant polyglutamine (polyQ) 
3 0 expression has been linked to a variety of 
neurodegenerative conditions, including Hungtington' s 
disease (Koide et al . (1994) Nat Genet 6:9-13), 
dentatorubro-pallidoluysian atrophy (Ranum et al. (1594) 
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Nat Genet 8:280-284), spinocerebellar ataxias (Kawaguch et 
al. (1994) Nat Genet 8:221-228, Imbert et al . (1993) Nat 
Genet 4:72-7S, Sanpei et al . (1996) Nat Genet 14:277-284, 
and Zhuchenko et al . (1997) Nat Genet 15:62-69), and 

5 spinobulbar muscular atrophy (David et al . (1997) Nat Genet 
17:65-70). PolyQ expansion has also been linked to the 
formation of intracellular protein aggregates in vivo in 
transgenic mice expressing human huntingtin fragments with 
expanded repeats (Ordway et al (1997) Cell 91:753-763), 

0 brain tissue from Huntington's patients (DiFiglia et al . 
(1997) Science 277:1990-1993) and transiently transfected 
cultured cells (Martindale et al (1998) Nat Genet 18:150- 
154 and Cooper et al . (1998) Hum Mol Genet 7:763-790). 

In transient transfection models, a 79-Q expanded 
5 repeat from the ataxin 3 gene has been shown to form a 
"punctate pattern" within the cytoplasm, and to result in 
the apopotic cell death of 70-80% of transfected COS-7 
cells within 48 hours of transfection. Paulson et al . has 
reported that 293 cells transfected with a separate 78 -Q 

20 ataxin-3 derived construct acquired intranuclear inclusion 
bodies and also underwent apoptotic cell death in a subset 
(30%) of the transfected population with 72 hours of 
transfection. Neuroblastoma cells transiently transfected 
with an 82 -Q construct derived from an expanded huntingtin 

25 gene exon 1 were found to be 2-3 times more susceptible to 
staurosporine induced apoptosis than cells transfected with 
a LacZ control construct (18) , however, spontaneous cell 
death was not described. Variations in culture techniques, 
transfection efficiencies, and cell lines, however, make it 

3 0 difficult to rigorously characterize the progression of 
polyQ expression, intracellular aggregate (lA) formation, 
and cell death. One impediment to the comparative study of 
lAs in vitro is that transient transfection studies predict 
that toxicity from polyQ over-expression will prevent the 

35 isolation of stable cultured cells. 
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Accordingly, there is a need in the art for improved 
systems to precisely modulate the expression of exogenous 
genes in mammalian subjects. For example, a non- 
mammalian-based transcription regulating system would be 
desirable for general application to transgene regulation 
in in vitro, ex vivo, and in vivo applications, as well as 
transgenic animals. Such systems would be extremely 
desirable to conditionally express pathogenic or toxic 
compounds in order to produce stable cells. A system that 
is simple, compact and dependent on ligands which are 
relatively inexpensive, readily available and of low 
toxicity in animals would prove useful for stimulation of 
regulated systems. 

BRIEF DE.qrPTPTioN Qp Twy. inventton 

15 In accordance with the present invention, it has been 

discovered that nuclear receptor proteins isolated from the 
silk moth bombyx mori (bR) are useful for the regulation of 
transgene expression. bR is generally thought to be a 
strong transcriptional regulator within cells of the silk 
2 0 moth. In accordance with the present invention, it has 
been discovered that bR is also functional in mammalian 
cells without the addition of exogenous dimer partners 
therefor. It has further been discovered that the addition 
of activation domains to the bR open-reading frame (VbR) 
enhances the activity of the ligand modulated regulator to 
afford high-level transcriptional induction (see, e.g.. 
Figure lA) . Further modifications to the bR ligand binding 
domain result in receptors with unique transactivation 
characteristics (see, e.g.. Figure IB). 



25 



30 



In accordance with another aspect of the present 
invention, hybrid proteins produced by fusion of modified 
bRs with other ligand-regulated proteins have been found to 
be capable of high level, regulated transactivation of 
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sequences controlled by both hormone response elements .and 
tetracycline operators, VbR variants and hybrid proteins 
(see, e.g., Figure IC) , in combination with the appropriate 
promoters and transgenes, can be introduced* into target 
5 cells by common methods such as transfection of plasm.ids or 
by virus mediated gene transfer. The small size and 
simplicity of these proteins makes them particularly 
attractive for use in retroviral vectors. 

Invention receptor peptides have proven capable of 
10 providing a high degree (>100 fold) of ligand- dependent 
transgene expression in a variety of infected and 
transfected cell types including primary cell types. In 
addition, several transgenes including lucif erase, LacZ, 
human GH, GFP, tyrosine hydroxylase, and the low-affinity 
15 NGF receptor, have all been shown to function in a 
ligand-dependent fashion in cells infected with invention 
constructs . 



Invention methods provide for regulated gene 
expression by exogenous non-mammalian inducers, and 

2C therefor can be advantageously employed in a variety of in 
vivo and in vitro mammalian expression systems. For 
example, inducible expression of flp recombinase in 
transgenic mammals, in accordance with invention methods, 
would enable those of skill in the art to accomplish 

25 temporally specific inducible gene targeting of the adult 
or the developing embryo (See, e.g., U.S. Patent Nos . 
5,654,182 and 5,677,177). Stable cell lines conditionally 
expressing desired genes provide a valuable tool for the 
study and treatment of diseases. In order to maximize the 

30 chance of obtaining stably transf ected/inf ected cultured 
cells expressing a desired transgene, such as polyglutamine 
expression cassette iPQEC) , a retroviral construct is 
provided in which expression of the transgene is controlled 
by an exogenous ligand. 
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For example, in order to study possible mechanisms 
underlying Huntington's disease, cultured cell types that 
conditionally express the glutamine (CAG) repeats 
characteristic of the pathological form of the huntingtin 
5 protein are produced. Two aspects of polyQ-mediated 
cytotoxicity are well -suited to characterization in vitro; 
the expression, distribution, and aggregation of proteins 
with expanded polyQ tracts within individual cells, and the 
progression of polyQ expression to death of the cell. The 

10 huntingtin protein is expressed predominantly in the 
cytoplasm of both neuronal and non-neuronal cell types. 
The expression of polyglutamine tracts separately from the 
huntingtin protein will allow one to determine what toxic 
properties, if any, polyglutamine tracts -themselves may 

15 harbor. A variable length polyglutamine fused to cytosolic 
reporter proteins such as green fluorescent protein (GFP) 
facilitates the selection and characterization of PQEC 
ligand-responsive cell populations and clonal lines. The 
use of recombinant retroviruses as PQEC transfer vectors 

20 will allow the stable integration of the transgene into a 
wide variety of cultured cell types. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure lA provides a schematic drawing of the 
nuclear receptor protein isolated from the silk moth 
2 5 bombyx mori (bR) , as well as a construct containing a 
VP16 activation domain fused at an internal Mlul site 
near the N- terminus of the bovnhyx mori nuclear receptor, 
referred to as VbR. 



30 



Figure IB depicts the replacement of several amino 
acids of the ligand binding domain of the bombyx mori 
nuclear receptor with sequences from the retinoic acid 
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receprcr or the thyroid hormone receptor (VbRRA and 
VbRTR, respectively) . 

Figure IC provides a schematic of a hybrid 
transactivating construct, designated TTMT, containing a 
5 tetracycline-responsive transactivator (Tet -responsive 
region) , a VP16 activation domain and a peptide derived 
from homhyx mori . 

Figure 2A shows the induction of fusion proteins 
(VP16 activation domain fused to either Drosophila 
10 ecdysone receptor (dVEcR) or bombyx mori nuclear receptor 
derived (VbR) ) v/ith 5 ^g/iv.l tebufenozide in the absence of 
exogenous dimer partners . 

Figure 2B shows that a peptide derived from homhyx 
mori responds to tebufenozide (teb) with greater than 
15 200-fold induction, while Drosophila EcR responds weakly, 
it at all. 

Figure 2C illustrates the action of tebufenozide 
(teb) on the expression of EcRE-Luc reporter when co- 
transfected with the following receptors: 
20 LINX: which encodes the tet-transactivator (TTA) , 

VbR: which encodes VbR, 

TTMT: which encodes the TTA- VbR fusion protein, and 
TTMT-2V: similar to TTMT but with two VP16 t -domains. 

Figure 2D depicts the same experiment illustrated in 
25 Figure 2C, utilizing a TetO-iuc reporter, which responds 
only to TTA or fusion proteins in the absence of 
doxy eye line . 
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Figure 2E shows the results of an experiment 
employing CV-1 cells transfected with TTMT and two 
reporters: 6TetO-luc and 4-EcRE-3-galactosidase. 

Figure 3 shows that lower levels of teb are as 
5 effective as higher levels for stimulating full activity. 

Figure 4 is a schematic drawing of representative 
single-plasmid constructs involving VbR use. 

10 Figure 5 illustrates the construction of the pBW 

plasmid. 

Figure 6A provides a comparison of Bombyx mori 
ecdysone receptor (BE) and Drosophila melanogaster ecdysone 
receptor (DE) by transient transfection with both receptors 
15 and the dimer partners Usp or RXR in 293 cells. 

Figure GB provides a similar comparison as in Figure 
6A, however, in CV-1 cells. 

Figure 7A is a schematic drawing of native and 
chimeric ecdysone receptors . 

2 0 Figure 7B is a graph depicting the transient 

transfection of VEH into CV-1 cells. 

Figure 8 is a schematic drawing comparing several 
different chimeric receptors . 

Figure 9A provides a schematic of receptor, variant 
25 viral vectors and an ecdysone response element responsive 
reporter virus (MS) . 
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Figure SB illustrates induction levels of luciferase 
in 293, CV-1 and primary fibroblast cells infected with MS- 
luc virus . 

Figure 10 illustrates the construction of the Boris I 
5 plasmid. 

Figure 11 illustrates the construction of the Boris 
III plasmid . 

Figure 12 illustrates the construction of the Boris IV 
plasmid. 

10 Figure 13 illustrates the construction of the pBW-6E- 

GFP-CB-\^E plasmid. 

Figure 14 illustrates the retroviral construct, BORIS 
III transcribes a transgene in the presence and absence of 
a ligand. 

15 Figures 15A and B illustrate the construction of a 

PolyQ expression construct. 

Figure 16 provides a comparison of various polyQ-GFP 
vectors constructed. 



DETAILED DESCRIPTION OF THE INVENTION 

20 In accordance with the present invention, there are 

provided nucleic acid constructs encoding a receptor 
peptide comprising: 

(i) a ligand binding domain and hinge region of 

a non-mammalian member of the nuclear 
25 receptor superfamily, 
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(ii) a DNA binding domain, and 

(iii) an activation domain, 

wherein the receptor peptide activates a regulatory element 
in the absence of an exogenous dimer partner therefor and 
5 in the presence of a ligand for said ligand binding domain. 
Alternatively, there are provided nucleic acid constructs 
encoding a receptor peptide comprising: 

(i) a ligand binding domain and hinge region of a 
non-mammalian member of the nuclear receptor 
10 superfamily, and 

(^i) a DNA binding domain, wherein the DNA 

binding domain is not obtained from the same 
member as the ligand binding domain and 
hinge region, 

15 wherein the receptor peptide activates a regulatory 
element in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 



In accordance with another embodiment of the present 
invention, there are provided method (s) employing such 
constructs. For example, there are provided method (s) for 
modulating the transcription of exogenous nucleic acid(s) 
in a host containing: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element; and 

(ii) a receptor peptide comprising a DNA binding 
domain, and the ligand binding domain and hinge 
region of a non-mammalian member of the nuclear 
receptor superfamily which is not normally 
present in the cells of said host, wherein said 
receptor peptide activates said regulatory 
element in the absence of an exogenous dimer 
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partner rherefor and in the presence of a ligand 
for said ligand binding domain; 

said method comprising administering to said host an 
amount of ligand effective to modulate the transcription of 
5 said exogenous nucleic acid(s) ; wherein said ligand is not 
normally present in the cells of said host. 



As employed herein, the terms "modulate" and 
"modulating" refer to the ability of a given 
ligand/receptor peptide complex to activate/deactivat.e 

10 and/or up- regulate/down-regulate transcription of exogenous 
nucleic acids, relative to the transactivation activity of 
said receptor peptide in the absence of ligand. The actual 
effect of complex formation on the transactivation activity 
of a receptor peptide will vary depending on the specific 

15 ligand and DNA binding domains employed in the receptor 
peptide and on the regulatory element with which the 
ligand/receptor peptide complex interacts. 

As employed herein, the term "host" refers to the 
cell, tissue, organ or organism in need of transcriptional 

20 regulation of exogenous or endogenous nucleic acids. 
Preferably, hosts are mammalian or mammalian derived cells 
or tissue. Exemplary mammals include: humans; domesticated 
animals, e.g., rat, mouse, rabbit, canine, and feline; farm 
animals, e.g., chicken, bovine, porcine and ovine; and 

25 animals of zoological interest, e.g., monkey and baboon; 
and the like. 



As employed herein, a "dimer partner" refers to 
members of the nuclear receptor superfamily to which other 
members preferentially bind to form heterodimeric species. 
3 0 For example, members of the nuclear receptor superfamily 
preferentially form heterodimers with a common partner, the 
retinoid X (or 9-cis retinoic acid) receptor (RXR, see, for 
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example, Yu et al. (1991) Cell 67:1251-1266; Bugge et al . 
(1992) EMBO J, 11:1409-1418; Kliewer et al . (1992) Nature 
355:446-449; Leid et al , (1992) Cell 68:377-395; Marks et 
al. (1992) EMBO J. 11:1419-1435; Zhang et al , (1992) Nature 
5 355:441-446; Issemann et al . (1993) Biochimie. 75:251-256). 
Additional dimer partners for members of the nuclear 
receptor superfamily include ultraspiracle (Usp) , farnesoid 
X receptor (FXR) , and the like. 

Receptor peptides utilized in the present invention 

10 are characterized as being fully functional in mammalian, 
cells without the addition of any exogenous dimer partners 
therefor. For such receptor peptides, the presence of 
native level or concentration of endogenous dimer partner 
is sufficient to promote transcription. As employed herein, 

15 the phrase "exogenous dimer partner" refers to a dimer 
partner for the receptor peptide that is introduced into 
the host. Exogenous dimer partner may be employed because 
the dimer partner is not present in the host cell (i.e., 
Usp) or because the dimer partner is required at elevated 

20 levels in the host (i.e., higher levels of RXR) . 
Oppositely, the phrase ^^endogenous dimer partner (s)" refers 
to dimer partners which are native to the unmodified 
mammalian host cell. Invention receptors are advantageous 
because dimer partner is not necessary at elevated levels 

25 in order for formation of heterodimers to occur, thereby 
promoting transcription. 



Members of the nuclear receptor superfamily are 
characterized by the presence of five domains: A/B, C, D, 
E and F (Evans, R. Science 240:889-895 (1988)), wherein "C" 
30 corresponds to the DNA binding domain, ^^D" corresponds to., 
the hinge region, and "E" corresponds to the ligand binding 
domain. In accordance with a particular aspect of the 
invention, it has been discovered that the hinge region 
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(D) , in coordination with the ligand binding domain (E) of 
certain non -mammalian members of the nuclear receptor 
superfamily enable such members to interact with dimer 
partners endogenous to mammalian cells, i.e., in the 
5 absence of and v/ithout requiring introduction of exogenous 
dimer partners. Specifically, it has been identified that 
regions within the hinge region C'D") and the ligand 
binding domain {"E"j of the bombyx mori receptor function 
in concert to produce high affinity, ligand -dependent 
10 heterodimerization between bR and mammalian dimer partners, 
without requiring the introduction of exogenous dimer 
partners such as Usp. 

Thus, in accordance with a preferred embodiment of the 
present invention, the invention receptor peptide comprises 

15 the hinge region and ligand binding domain derived from 
insect receptors of lepidopteran species such as bombyx 
mori, (Swevers et al . Insect Biochem. Molec. Biol. 
25 (7) :S57-866 (1995)), Choristoneura fumiferana (Palli et 
ai. Insect Biochem. Molec. Biol. 26 (5 ): 485-499 (1996)), 

20 Manduca sexta (Fujiwara et al . Insect Biochem. Molec. Biol. 
25 (7) :845-856 (1995)), Aedes aegypti (Cho et al . Insect 
Biochem Molec. Biol. 25:19-27 (1995), Chorinomus tentans 
(Imhof et al. Insect Biochem. Molec. Biol. 25:115-124 
(1993) , and the like. Presently preferred insect receptors 

25 from which the hinge region and/or ligand binding domain is 
derived substantially lack the C-terminal "F" domain. In 
addition, presently preferred insect receptors share less 
than 50% sequence similarity with Drosophila ecdysone 
receptors, comparisons performed by Higgins and Sharp 

30 sequence alignment. 

As employed herein, the term ''hinge region" or ''D 
domain" refers to the domain located between the DNA 
binding domain and the ligand binding domain of intact 
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members of the nuclear receptor superfamily. In accordance 
with a preferred embodiment of the present invention, the 
hinge region facilitates high affinity interaction with 
dimer partners endogenous to mammalian cells, also referred 
5 herein as ''endogenous dimer partners." Preferably, the 
hinge region of a non-mammalian member of the nuclear 
receptor superfamily can be characterized as any sequence 
having substantial sequence identity with amino acid 
residues 273-362 set forth in SEQ ID NO: 3, or substantial 
10 portions thereof which confer sufficiently high affinity 
for endogenous dimer partner. 

The hinge region can be functionally located in either 
orientation and at various positions within the receptor 
peptide. For example, the hinge region can be positioned 
15 at either the amino or carboxy terminus of the receptor 
peptide, or therebetween. In a preferred embodiment of the 
present invention, the hinge region is positioned 
internally between the ligand binding and DNA binding 
domains of the receptor peptide (see Figure lA) . 

20 As employed herein, the phrase "ligand binding domain 

of a non-mammalian member of the nuclear receptor 
superfamily" refers to ligand binding domains derived from 
receptors which are not endogenous to a mammalian host. 
Ligand binding domains which are not endogenous to a 

25 mammalian host include ligand binding domains which are 
modified to be non- responsive to ligands endogenous or 
native to the host. Ligand binding domains contemplated 
for use according to the present invention can be derived 
from non-mammalian member (s) of the nuclear receptor 

3 0 superfamily which members are not normally present in the 
cells of a host . Ligand binding domains are preferably 
derived from the carboxy- terminal portion of non-mammalian 
receptors which are capable of activating transcription of 
regulatory elements in the absence of exogenous dimer 
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partner therefore. Exemplary receptors which are not 
normally present in mammalian cells include insect 
receptors, plant receptors, and the like. 

Ligand binding domains can be functionally located in 
■ 5 either orientation and at various positions within the 
receptor peptide. For example, the ligand binding domain 
can be positioned at either the amino or carboxy terminus 
of the receptor peptide, or therebetween. In a preferred 
embodiment of the present invention, the ligand binding 
10 domain is positioned at the carboxy term.inus of the 
receptor peptide (see Figure lA) . 

Exemplary ligand binding domains can alternatively be 
characterized as comprising substantial sequence identity 
with amino acid residues 353-508 (''E2"), preferably amino 

15 acid residues 393-586 {E2 and E., wherein ''E3" is amino acid 
residues 509-586), as set forth in SEQ ID N0:3, or 
substantial portions thereof (i.e., typically at least 46 
or more contiguous nucleotides thereof) . It has been 
determined that the specific ligand response determinant of 

2 0 bombyx lies within the E2 region of the native bR, whereas 
both E2 and E3 regions facilitate heterodimer formation with 
endogenous dimer partners. Modifications of this sequence 
contemplated for use in the practice of the present 
invention include replacing several amino acids of the 

25 ligand binding domain with sequences from ligand binding 
domains of other members of the nuclear receptor 
superfamily, such as retinoic acid receptor and/or thyroid 
hormone receptor (Figure IB) . These modifications provide 
unique transactivating characteristics and/or eliminate 

30 restriction sites, which facilitate the construction of 
useful peptide-based viral constructs. 

DNA-binding domains contemplated for use in the 
preparation of invention receptor peptides are typically 
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obtained from DNA-binding proteins (e.g., transcription 
factors) . The term "DNA-binding domain" is understood in 
the art to refer to an amino acid sequence that is able to 
bind to DNA. As used herein, the term "DNA-binding domain" 
5 encompasses a minimal peptide sequence of a DNA-binding 
protein up to the entire length of a DNA-binding protein, 
so long as the DNA-binding domain functions to associate 
with a particular regulatory element. 

DNA-binding domains are known to function 
heterologously in combination with other functional domains 
by maintaining the ability to bind DNA recognition sequence 
(see, e.g., Brent and Ptashne, (1985) Cell, 43:729-736). 
For example, with respect to hormone receptors, DNA-binding 
domains are interchangeable, thereby providing numerous 
chimeric receptor proteins (see, e.g., U.S. Patent 
4,981,784; and Evans, R. , (1988) Science, 240:889-895). 
Similar to the ligand binding domain, the DNA-binding 
domain can be positioned at either the carboxy terminus or 
the amino terminus, or the DNA-binding domain can be 
positioned between the ligand binding domain and the 
activation domain. In preferred embodiments of the present 
invention, the DNA-binding domain is positioned internally 
between the ligand binding domain and the activation 
domain . 

25 "DNA-binding protein (s)" contemplated for use herein 

belong to the well-known class of proteins that are able to 
directly bind DNA and facilitate initiation or repression 
of transcription. Exemplary DNA-binding proteins 

contemplated for use herein include transcription control 

30 proteins (e.g., transcription factors and the like; Conaway 
and Conaway, 1994, "Transcription Mechanisms and 
Regulation", Raven Press Series on Molecular and Cellular 
Biology, Vol. 3, Raven Press, Ltd., New York, NY) . 



15 
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Transcription factors cont emplaned for use herein as 
a source of such DNA binding domains include, e.g., 
homeobox proteins, zinc finger proteins, hormone receptors, 
helix- turn-helix proteins , helix- loop-helix proteins , 
5 basic-Zip proteins (bZip) , (i-ribbon factors, and the like. 
See, for example, Harrison, S., "A Structural Taxonomy of 
DNA-binding Domains," J\7ature, 353:715-719. Homeobox DNA- 
binding proteins suitable for use herein include, for 
example, HOX, STF-1 (Leonard et al.., 1993, Mol . Endo., 

10 7:1275-1283), Antp, Mat a-2, INV, and the like. See, also, 
Scott et al. (1989), Biochem. Biophys. Acta, 989:25-48. Xt 
has been found that a fragment of 76 amino acids 
(corresponding to amino acids 140-215 described in Leonard 
et al., (1993) Mol , Endo., 7:1275-1283) containing the STF- 

15 1 homeodomain binds DNA as tightly as wild- type STF-r. 
Suitable zinc finger DNA-binding proteins for use herein 
include Zif268, GLI , XFin, and the like. See also, Klug 
and Rhodes (1987) Trends Biochem. 5ci . , 12:464; Jacobs and 
Michaels (1990) New Biol . , 2:583; and Jacobs (1992), EMBO 

20 J., 11:4507-4517. 

An additional DNA binding domain contemplated for use 
in the practice of the present invention is the GAL4 DNA 
binding domain. The DNA binding domain of the yeast GAL4 
protein comprises at least the first 74 amino terminal 
25 amino acids thereof (see, for example, Keegan et al . , 
Science 231:699-704 (1986)). Preferably, the first 90 or 
more amino terminal amino acids of the GAL4 protein will be 
used, with the 14 7 amino terminal amino acid residues of 
yeast GAL4 being presently most preferred. 

30 Preferably, DNA-binding domain (s) used herein is (are) 

obtained from a member of the nuclear receptor superfamily. 
As used herein, the phrase "member (s) of the nuclear 
receptor superfamily" (also known as "intracellular 
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receptors" or "steroid/thyroid hormone superfamily of 
receptors") refers to hormone binding proteins that operate 
as ligand- dependent transcription factors, including 
identified members of the nuclear receptor superfamily for 
5 which specific ligands have not yet been identified 
(referred to hereinafter as "orphan receptors"). 

Exemplary members of the steroid/thyroid hormone 
superfamily of receptors (including the various isoforms 
thereof) include steroid receptors such as glucocorticoid 

10 receptor (GR) , mineralocorticoid receptor (MR) , estrogen 
receptor (ER) , progesterone receptor (PR) , androgen 
receptor (AR) , vitamin D3 receptor (VDR) , various isoforms 
of peroxisome proliierator-activated receptors (PPARs) , and 
the like; plus retinoid receptors, such as the various 

15 isoforms of retinoic acid receptor (e.g., RARa, RAR3, or 
RARy) / the various isoforms of retinoid X receptor (e.g., 
RXRa, RXR6, or RXRy) r and the like (see, e.g., U.S. Patent 
Nos. 4,981,784; 5,171,671; and 5,071,773); thyroid 
receptors (TR) , such as TRa, TRp, and the like; insect 

2 0 derived receptors such as the ecdysone receptor, and the 
like; as well as other gene products which, by their 
structure and properties, are considered to be members of 
the superfamily, as defined hereinabove, including the 
various isoforms thereof. Examples of orphan receptors 

25 contemplated for use herein as a source of DNA binding 
domain include HNF4 (see, for example, Sladek et al . , in 
Genes & Development 4: 2353-2365 (1990)), the COUP family 
of receptors (see, for example, Miyajima et al . , in l\7ucleic 
Acids Research 16: 11057-11074 (1988), and Wang et al . , in 

30 Nature 340: 163-166 (1989)), COUP-like receptors and COUP 
homologs, such as those described by Mlodzik et al . , in 
Cell 60: 211-224 (1990) and Ladias et al . , in Science 251: 
561-565 (1991) , the insect derived knirps and knirps- 
related receptors, and the like. 
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The DNA-binding domains of all members of the nucleai' 
receptor superfamily are related. Such domains consist of 
66-68 amino acid residues, and possess about 20 invariant 
amino acid residues, including nine cysteines." Members of 
5 the superfamily are characterized as proteins which contain 
these 20 invariant amino acid residues. The highly 
conserved amino acids of the DNA-binding domain of members 
of the superfamily are as follows: 

Cys - X - X - Cys - X - X - Asp* - X - Ala* - X - 
10 Gly* - X - Tyr* -X-X-X-X- Cys - X - X - 

Cys - Lys* - X - Phe - Phe - X - Arg* - X - X - 
X-X-X- X-X-X-X- (X-X-) Cys - X -X 

- X - X - X - (X - X - X -) Cys -X-X-X - Lys 

- X - X - Arg - X - X - Cys - X - X - Cys - Arg* 
15 - X - X - Lys* - Cys - X - X - X - Gly* - Met 

(SEQ ID N0:1) ; 

wherein X designates non-conserved amino acids within the 
DNA-binding domain; an asterisk denotes the amino acid 
residues which are almost universally conserved, but for 
20 which variations have been found in some identified hormone 
receptors; and the residues enclosed in parenthesis are 
optional residues (thus, the DNA-binding domain is a 
minimum of 66 amino acids in length, but can contain 
several additional residues) . 

25 It has also been found that in vitro evolution mechods 

can be applied to modify and improve existing DNA-binding 
domains (see, e.g., Devlin et ai . , 1990, Science . 249:404- 
406; and Scott and Smith, 1990, Science , 249:386-390). 
Alternatively, DNA.-binding domains which are engineered 

30 with novel DNA- recognition specificity (see, e.g., 
Pomerantz et al . Science 267:93-96, 1995, ZFHDl, an 
engineered transcription factor with a composite - DNA- 
binding domain) are also contemplated. 
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Two polynucleotides or polypeptides are said to be 
^'identical" if the sequence of nucleotides or amino acid 
residues in the two sequences is the same when aligned for 
maximum correspondence. Optimal alignment of sequences for 
5 comparison may be conducted by the local homology algorithm 
of Smith and Waterman, Adv. Appl . Math., 2:482 (1981), by 
the homology alignment algorithm of Needleman and Wunsch, 
J, Mol. Biol., 48:443 (1970), by the search for similarity 
method of Pearson and Lipman, Proc. Natl. Acad. Sci. 
10 (U.S.A.), 85:2444 (1988), by computerized implementations 
of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the 
Wisconsin Genetics Software Package, Genetics Computer 
Group, 575 Science Dr., Madison, Wis.), or by inspection. 
These references are incorporated- herein by reference. 

15 The percentage of sequence identity between two 

sequences is determined by comparing two optimally aligned 
sequences over a window of comparison of at least 20 
positions. The percentage is calculated by determining the 
number of positions at which the identical nucleic acid 

20 base or amino acid residue occurs in boch sequences to 
yield the number of matched positions, dividing the number 
of matched positions by the total number of positions in 
the window of comparison (i.e., the window size), and 
multiplying the result by 100 to yield the percentage of 

25 sequence identity. 

For instance, a preferred method for comparing 
sequences uses the GAP program based on the algorithm of 
Needleman, et at., supra. Typically, the default values for 
all parameters are selected. These are gap weight; 5.0, 
30 length weight: 0.30, average match: 1..0, and average 
mismatch: 0.0. 
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As used herein, the phrase "substantial sequence 
identity" refers to nucleotide or amino acid sequences 
which share at least 80% sequence identity , preferably 90%, 
more preferably 95 % or more, regardless of the algorithm 
5 used to determine sequence identity, compared to a 
reference sequence over a comparison window of about 20 bp 
to about 2000 bp, ti'pically about 50 to about 1500 bp, 
usually about 350 bp to about 1200. The values of percent 
identity are preferably determined using the GAP program, 

10 referred to above. Another indication that nucleotide 
sequences are substantially identical is if two molecules 
hybridize to each other under stringent conditions. It is 
recognized, however, that proteins (and DNA or mRNA. 
encoding such proteins) containing less than the above - 

±5 described level of homology produced as splice variants or 
as a result of conservative amino acid substitutions (or 
substitution of degenerate codons) are contemplated to be 
within the scope of the present invention. 

The phrase "substantially the same," used in reference 
20 to a DNA nucleotide sequence, an RNA ribonucleotide 
sequence, or an amino acid sequence, refers to sequences 
that have slight and non- consequential sequence variations 
from the actual sequences disclosed herein. Species that 
are substantially the same are considered to be equivalent 
25 to the disclosed sequences. In this regard, "slight and 
non- consequential sequence variations" mean that sequences 
that are substantially the same as the DNA, RNA, or 
proteins disclosed and claimed herein are functionally 
equivalent to the sequences disclosed and claimed herein. 
30 Functionally equivalent sequences will function in 
substantially the same manner to produce substantially the 
same effect as the sequences disclosed. 

Receptor peptides employed in the present invention 
can be modified by the introduction of activation domains. 
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Activation domains contemplated for use in the practice of 
the present invention are well known in the art and can 
readily be identified by those of skill in the art. 
Activation domains contemplated for use herein are 
5 typically derived from transcription factors and comprise 
a contiguous sequence that functions to activate gene 
expression when associated with a suitable DNA-binding 
domain and a suitable ligand binding domain. An activation 
domain can be positioned at any convenient site within the 
10 receptor peptide, i.e., at the carboxy terminus, the amino 
terminus or between the ligand binding domain and the DNA 
binding domain. In preferred embodiments of the present 
invention, the accivation domain is positioned at the amino 
terminus of the receptor peptide . 

15 Suitable activation domains can be obtained from a 

variety of sources, e.g., from the N-terminal region of a 
member of the steroid/thyroid hormone superfamily of 
receptors, from a transcription factor activation domain, 
such as, for example, VP16, GAL4, NF-kB or BP64 activation 

2 0 domains, and the like. The presently most preferred 

activation domain contemplated for use in the practice of 
the present invention is obtained from the N-terminal 
region of the VP16 protein. 

In a particular embodiment of the present invention, 
25 the invention nucleic acid construct further comprises 
promoters and regulatory elements operatively associated 
with exogenous nucleic acids. In a preferred embodiment of 
the present invention, receptor peptide, in the presence of 
a ligand therefor, binds the regulatory element and 

3 0 activates transcription of the exogenous nucleic acids. 

Regulatory elements contemplated for use in the 
practice of the present invention include elements 
responsive to the invention receptor peptide. In a 
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preferred embodiment of the present invention, such 
elements are exogenous regulatory elements not normally 
present in the cells of the host. One class of exogenous 
regulatory elements contemplated for use herein includes 
5 hormone response elements which modulate transcription of 
exogenous nucleic acid when bound to the DNA binding domain 
of an invention receptor peptide. 



Additional regulatory elements that may be utilized in 
the practice of the present invention include exogenous 

10 regulatory elements responsive to a non-mammalian 
transact ivator. One such transactivator-responsive 

regulatory element is an operator which confers 
responsiveness to antibiotics.- Exemplary operators 
contemplated for use in this aspect of the invention 

15 include the tetracycline-analog regulated operator, the TET 
operator, the Lac operator, and the like. 

Exogenous response elements contemplated for use 
herein are short cis-acting sequences (i.e., having about 
12-20 bp) that are required for activation of transcription 

20 in response to association with a suitable ligand, such as 
diacyl hydrazines, and an invention receptor peptide. 
Response element sequences contemplated for use herein 
function in a position- and orientation- independent 
fashion. Exemplary response elements include hormone 

25 response elements, GAL4 response elements and the like. 



Hormone response elements contemplated for use in the 
practice of the present invention are response elements 
which are responsive to members of the nuclear receptor 
superfamily. These response elements comprise at least two 
30 half-sites (in either direct repeat or inverted repeat 
orientation to one another) , separated by a spacer of 0-5 
nucleotides. As used herein, the term "half -site" refers 
to a contiguous 6 nucleotide sequence that is bound by a 
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particular member of the nuclear receptor superfamily. 
Typically, two half-sites with a corresponding spacer make 
up a hormone response element. Hormone response elements 
can be incorporated in multiple copies into various 
5 transcription regulatory regions." 

Preferred hormone response elements employed in the 
practice of the present invention comprise a first half- 
site and a second half -site, separated by a spacer of 0-5 
nucleotides; 

10 wherein each half -site has the sequence: 

-RGBNNM- , 
(or complements thereof) wherein 

each R is independently selected from A or G; 
each B is independently selected from G, C, or T; 
15 each N is independently selected from A, T, C, or G; 

and 

each M is independently selected from A or C; 
with the proviso that at least 4 nucleotides of each 
-RGBNNM- group of nucleotides are identical with the 
2 0 nucleotides at comparable positions of the sequence 
-AGGTCA- . 

Exemplary half-sites having the -RGBNNM- motif for use 
in preparing response elements useful in the practice of 
the present invention include, for example, half -sites 
25 selected from -AGGGCA- , -AGTTCA- , -AGGTAA-, -AGGTCA- , 
-GGTTCA-, -GGGTTA-, -GGGTGA- , -AGGTGA- , -GGGTCA-, and the 
like. A particularly preferred first half -site is 
-AGTGCA- , 

Additional response elements included the GAL4 
30 response element. Exemplary GAL4 response elements are 
those containing the palindromic 17-mer: 
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5 ' -CGGAGGACTGTCCTCCG-3 ' ( SEQ ID NO : 4 ) , 

such as, for example, 17MX, as described by VJebster et al . , 
in Cell 52:165-176 (1988), as well as derivatives thereof. 
Additional examples of suitable response elements include 
5 those described by Hollenberg and Evans in Ceil 55:895-906 
(1.988); or Webster et al . in Ceil 54:199-207 (1988). 

Regulatory elements employed in the practice of the 
present invention are operably linked to a suitable 
promoter for transcription of exogenous nucleic acid(s.) 
10 product (s). When exogenous nucleic acid(s), operatively 
linked to a suitable promoter, is (are) introduced into the 
cells of a suitable host, expression of the exogenous 
nucleic acid(s) is (are) controlled by the presence of 
ligands, which are not normally present in the host cells. 

15 As used herein, when referring to nucleic acids, the 

phrase "exogenous to said mammalian host" or simply 
"exogenous" refers to nucleic acids not naturally found at 
levels sufficient to provide a function in the particular 
cell where transcription is desired. For example, 

20 exogenous nucleic acids can be either natural or synthetic 
nucleic acids, which are introduced into the host in the 
form of DNA or RNA. The nucleic acids of interest ' can be 
introduced into target cells (for in vitro applications) , 
or the nucleic ■ acids of interest can be introduced directly 

25 or indirectly into a host by the transfer of transformed 
cells into a host. 

In contrast to exogenous nucleic acids, the phrase 
"endogenous nucleic acids" or "endogenous genes" refers to 
nucleic acids naturally found at levels sufficient to 
30 provide a function in the particular cell where 
transcription is desired. 
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Exogenous nucleic acids contemplated for use in the 
practice of the present invention include wild type and/or 
therapeutic nucleic acids. 

"Wild type" genes are those " that are native to cells 
5 of a particular type. Exemplary wild type nucleic acids 
are genes which encode products: 

the substantial absence of which leads to the 
occurrence of a non-nor*raal state in a host; or 

a substantial excess of which leads to the 
10 occurrence of a non-normal state in a host. 

Such genes may not be expressed in biologically significant 
levels or may be undesirably overexpressed, respectively. 
Thus, for example, while a synthetic or natural gene coding 
for human insulin would be exogenous genetic material to a 
15 yeast cell (since yeast cells do not naturally contain 
insulin genes) , a human insulin gene inserted into a human 
skin fibroblast cell would be a wild type gene with respect 
to the fibroblast since human skin fibroblasts contain 
genetic material encoding human insulin, although human 
2 0 skin fibroblasts do not express human insulin in 
biologically significant levels. 



Therapeutic nucleic acids contemplated for use in the 
practice of the present invention include those which: 

encode products which are toxic to the cells in 
25 which they are expressed; or 

encode products which impart a beneficial 
property to a host; or 
those which transcribe nucleic acids which modulate 
transcription and/or translation of endogenous genes. 

30 As employed herein, the phrase "therapeutic nucleic 

acids" refers to nucleic acids which impart a beneficial 
function to the host in which such nucleic acids are 
transcribed. Therapeutic nucleic acids are those that are 
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not narurally found in host cells. For example, synthetic 
or natural nucleic acids coding for wild t^e human insulin 
would be therapeutic when inserted into a skin fibroblast 
cell so as to be expressed in a human host, where the human 
5 host is not otherwise capable of expressing functionally 
active human insulin in biologically significant levels. 
Further examples of therapeutic nucleic acids include 
nucleic acids which transcribe antisense constructs used to 
suppress the expression of endogenous genes . Such 

10 antisense transcripts bind endogenous nucleic acid (mRNA or 
DNA) and effectively cancel out the expression of the gene . 
In accordance with the methods described herein, 
therapeutic nucleic acids are expressed at a level that 
provides a therapeutically effective amount of the 

15 corresponding therapeutic protein. 

Exogenous nucleic acids useful in the practice of the 
present invention include genes that encode biologically 
active proteins of interest, such as, e.g., secretory 
proteins that can be released from said cell ; enzymes that 

2 0 can metabolize a toxic substance to produce a non- toxic 

substance, or that metabolize an inactive substance to 
produce a- useful substance; regulatory proteins; cell 
surface receptors; and the like. Useful genes include 
genes that encode blood clotting factors such as human 
25 factors VIII and IX; genes that encode hormones such as 
insulin, parathyroid hormone, luteinizing hormone releasing 
factor (LHRH) , alpha and beta seminal inhibins, and human 
growth hormone; genes that encode proteins such as enzymes, 
the absence of which leads to the occurrence of an abnormal 

3 0 state; genes encoding cytokines or l\Tnphokines such as 

interferons, granulocytic macrophage colony stimulating 
factor (GM-CSF) , colony stimulating factor- 1 (CSF-1) , tumor 
necrosis factor (TNF) , and er^^'thropoietin (EPO) ; genes 
encoding inhibitor substances such as alpha. -antitrypsin; 
3 5 genes or nucleotides which characterize specific conditions 
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or diseases (i.e., CAG repeat expansion and concomitant 
polyglutamine (polyQ) expression has been linked to a 
variety of neurodegenerative conditions including 
Huntington's disease, dentatorubro-pallidoluysian atrophy, 
5 spinocerebellar ataxias, and spinbbulbar muscular atrophy, 
and the like); genes encoding substances that function as 
drugs, e.g., genes encoding the diphtheria and cholera 
toxins; and the like. 



Additional nucleic acids contemplated for use in 
10 accordance with the present invention include genes which 
encode proteins present in dopaminergic neurons (useful," 
for example, for the treatment of Parkinson's disease), 
cholinergic neurons (useful, for example, for the treatment 
of Alzheimer's disease), hippocampal pyramidal neurons 
15 (also useful for the treatment of Alzheimer's disease), 
norepinephrine neurons (useful, for example, for the 
treatment of epilepsy), spinal neurons (useful, for 
example, for the treatment of spinal injury), glutamatergic 
neurons (useful, for example, for the treatment of 
20 schizophrenia), cortical neurons (useful, for example, for 
the treatment of stroke and brain injury) , motor and 
sensory neurons (useful, for example, for the treatment of 
amyotrophic lateral sclerosis), and the like. 

Typically, nucleic acid sequence information for 
25 proteins encoded by exogenous nucleic acid(s) contemplated 
for use employed herein can be located in one of many 
public access databases, e.g., GENBANK, EMBL, Swiss-Prot, 
and PIR, or in related journal publications. Thus, those 
of skill in the art have access to sequence information for 
virtually all known genes. Those of skill in the art can 
either obtain the corresponding nucleic acid molecule 
directly from a public depository or the institution that 
published the sequence. Optionally, once the nucleic acid 
sequence encoding a desired protein has been ascertained, 



30 
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the skilled artisan can employ routine methods, e.g., 
polymerase chain reaction (PGR) amplification, to isolate 
the desired nucleic acid molecule from the appropriate 
nucleic acid library. Thus, all known nucleic acids 
5 encoding proteins of interest are available for use in the 
methods and products described herein. 

Additional components which can optionally be 
incorporated into invention constructs include selectable 
markers . Selectable markers contemplated for use in the 

10 practice of the present invention include radiolabeled 
molecules, fluorescent molecules, ligands, enzymes, and the 
like. Preferable selectable markers are enzymes such as 
antibiotic resistance genes, genes which enable cells to 
process metabolic intermediaries, and the like. Exemplary 

15 antibiotic resistance genes include genes which impart 
tetracycline resistance, genes which impart ampicillin 
resistance, zeomycin resistance, neomycin resistance, 
hygromycin resistance, puromycin resistance, and the like. 
Genes which enable cells to process metabolic 

20 intermediaries include genes which permit cells to 
incorporate L-histidinol , genes encoding thymidine kinase, 
genes encoding xanthine-guanine phosphoribosyl transferase 
(gpt) , genes encoding dihydrof olate reductase, genes 
encoding asparagine synthetase, and the like. 

25 In accordance with a preferred embodiment of the 

present invention, the invention nucleic acid construct 
further comprises, in addition to the receptor peptide, a 
non-mammalian transact ivator, not a member of the nuclear 
receptor superfamily and not normally present in the cells 

3 0 of said host, and a compatible transactivator responsive 
regulatory element not normally present in cells of said 
host. The transactivator responsive regulatory element 
controls transcription of the exogenous nucleic acid{s) or 
a second nucleic acid construct comprising a second 
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exogenous nucleic acid(s). Examples of transactivator 
responsive regulatory elements include operators which are 
responsive to non-mammalian transact ivators which confer 
responsiveness to antibiotics. Exemplary operators 
5 contemplated for use in this aspect of the invention 
include the tetracycline-analog regulated operator, the TET 
operator, the Lac operator, and the like. Preferably, the 
transactivator responsive regulatory elements employed in 
the practice of the present invention are operably linked 
10 to a suitable promoter for transcription of exogenous 
nucleic acid(s) proteins. 

Non- mammalian transactivators , other than members of 
the nuclear receptor superfamily, contemplated for use in 
the practice of the present invention function in the 
15 absence of exogenous dimer partner. Examples of 
transactivators that typically function in the absence of 
exogenous dimer partner are tetracycline-controlled 
transactivators, Vpl6-Lac fusion transactivators, and the 
like. When contained as part of a transactivating 

2 0 construct, the transactivator can be positioned at any 

convenient site within the construct, i.e., at the carboxy 
terminus or the amino terminus of the transactivating 
construct. In preferred embodiments of present invention, 
the transactivator is positioned at the amino terminus of 
25 the transactivating construct (Figure IC) . 

Preferably, the non-mammalian transactivator confers 
responsiveness to antibiotics. An example of a non- 
mammalian transactivator which confers responsiveness to 
antibiotics, as contemplated for use in the practice of the 

3 0 present invention is the tetracycline-controlled 

transactivator. The tetracycline inducible system is well- 
known in the art (see, e.g, Gossen et al . (1992) PNAS 89, 
5547-5551; Gossen et al. (1993) TIBS 18, 471-475; Furth et 
al. (1994) PNAS 91, 9302-9306; Shockett et al . (1995) PNAS 
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92, 6522-6526; and Hoshimaru ei: al . (1996) PNAS 93(4) :1518- 
1523). Other examples of non- mammalian transactivators 
well known in the art include the IPTG inducible system 
based on a VP16-Lac repressor fusion which functions 
5 through lac operator sequences inserted into heterologous 
promoters (see, e.g., Bairn et al . (19S1) PNAS 86:5072- 
5076) . 

In addition to the receptors/transactivators set forth 
above, those of skill in the art recognize that other 

10 transacuivators can be used herein, e.g., homeobox 
proteins, zinc finger proteins, hormone receptors, helix- 
turn-helix proteins, helix-loop-helix proteins, basic-Zip 
proteins (bZip) , B-ribbon factors,- and the like. See, for 
example, Harrison, S., "A Structural Taxonomy of DNA- 

15 binding Domains," Nature, 353:715-719. Homeobox DNA- 
binding proteins suitable for use herein include, for 
example, HOX, STF-1 (Leonard et al . , (1993) Mol . Endo., 
7:1275-1283), Antp, Mat a-2, INV, and the like. See, also, 
Scott et al. (1989) Biochein. Biophys . Acta, 989:25-48. It 

20 has been found that a fragment of 76 amino acids 
(corresponding to amino acids 14 0-215 described in Leonard 
et al., (1993) Mol . Endo. , 7:1275-1283) containing the STF- 
1 homeodomain binds DNA as tightly as wild-type STF-1. 
Suitable zinc finger DNA-binding proteins for use herein 

25 include Zif268, GLI, XFin, and the like. See also, Klug 
and Rhodes (1987) Trends Biochem. Sci . , 12:464; Jacobs and 
Michaels (1990) New Biol., 2:583; and Jacobs (1992) EMBO 
J., 11:4507-4517. 

In a preferred embodiment of the present invention, 
30 expression cassettes are prepared by operably linking 
invention nucleic acid constructs to a suitable promoter 
for expression of the encoded receptor peptide. A.s used 
herein, the term "promoter" refers to a specific nucleotide 
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sequence recognized by RNA polymerase, the enzyme that 
initiates RNA synthesis. The promoter sequence is the site 
at which transcription can be specifically initiated under 
proper conditions. When nucleic acid constructs, 
5 operatively linked to a suitable" promoter, is introduced 
into the cells of a suitable host, expression of the 
nucleic acid construct is conditionally or perpetually 
initiated . 



Promoters contemplated for use in the practice of the 
10 present invention include inducible (e.g., minimal CMV 
promoter, minimal TK promoter, modified MMLV LTR) , 
constitutive (e.g., chicken b-actin promoter, MMLV LTR 
(non-modified), DHFR) , and/or tissue specific promoters. 

Inducible promoters contemplated for use in the 
15 practice of the present invention comprise transcription 
regulatory regions that function maximally to promote 
transcription of mRNA under inducing conditions. Examples 
of suitable inducible promoters include DNA sequences 
corresponding to: the E. coli lac operator responsive to 
20 IPTG (see Nakamura et al . , Cell, 18:1109-1117, 1979); the 
metallothionein promoter metal -regulatory-elements 
responsive to heav^- metal (e.g., zinc) induction (see Evans 
et al., U.S. Patent No. 4,870,009), the phage T71ac 
promoter responsive to IPTG (see Studier et al . , Meth. 
25 Enzymol., 185: 60-89, 1990; and U.S. #4,952,496), the heat - 
shock promoter; the TK minimal promoter; the CMV minimal 
promoter; a synthetic promoter; and the like. 

Exemplary constitutive promoters contemplated for use 
in the practice of the present invention include the CMV 
30 promoter, the SV40 promoter, the DHFR promoter, the mouse 
mammary tumor virus (MMTV) steroid- inducible promoter, 
Moloney murine leukemia virus (MMLV) promoter, elongation 



wo 99/02683 



PCT/US98/14215 



33 

factor la (EFla) promoter, albumin promoter, APO Al 
promoter, cyclic AMP dependent ]:inase II (CaMKII) promoter, 
keratin promoter, CD3 promoter, immunoglobulin light or 
heavy chain promoters, neui-of iliment promoter, neuron 
5 specific enolase promoter, L7 promoter, CD2 promoter, 
myosin light chain kinase promoter, HOX gene promoter, 
thj^idine kinase (TK) promoter, RNA Pol II promoter, MYOD 
promoter, MYF5 promoter, phophoglycerokinase (PGK) 
promoter, Stfl promoter. Low Density Lipoprotein (LDL) 
10 promoter, chicken b-actin promoter (used in conjunction 
with ecdysone response element) and the like. 

As readily understood by those of skill in the art, 
the term "tissue specific" refers to the substantially 
exclusive initiation of transcription in the tissue from 

15 which a particular promoter, which drives expression of a 
given gene, is derived (e.g., expressed only in .T-cells, 
endothelial cells, smooth muscle cells, and the like) . 
Exemplary tissue specific promoters contemplated for use in 
the practice of the present invention include the GH 

2 0 promoter, the NSE promoter, the GFAP promoter, 
neurotransmitter promoters (e.g., tyrosine hydroxylase, TH, 
choline acetyltransf erase , ChAT, and the like) , promoters 
for neurotropic factors (e.g., a nerve growth factor 
promoter, NT-3, BDNF promoters, and the like), and -so on. 

25 In yet another embodiment of the present invention, 

there are provided constructs comprising a promoter, a 
tetracycline-controlled transactivator , a VP16 activation 
domain, a DNA binding domain and boiuhyx mori- derived hinge 
region and ligand binding domain encoding sequence, wherein 

30 the components of the construct are operatively associated, 
with the other components of the construct . 

As used herein, the phrase "operatively associated 
with" refers to the functional relationship of DNA with 
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regulatory and effector sequences of nucleotides, such as 
promoters, enhancers, transcriptional and translational 
stop sites, and other signal sequences. For example, 
operative linkage of DNA to a promoter refers to the 
5 physical and functional relationship between the DNA and 
promoter such that the transcription of such DNA is 
initiated from the promoter by an RNA polymerase that 
specifically recognizes, binds to and transcribes the DNA. 



10 



20 



In yet another embodiment of the' present invention, 
there are provided constructs comprising nucleic acid(s) 
encoding a a VP16 activation domain operatively associated 
with nucleic acid encoding a DNA binding domain and bombyx 
mori-derived hinge region and ligand binding domain. 

In accordance with another embodiment of the present 
15 invention, there are provided gene transfer vectors useful 
for the introduction of invention constructs into suitable 
host cells . Such gene transfer vectors preferably comprise 
a first reporter under the control of a regulatory element, 
a second reporter under the control of an operator which is 
responsive to a ligand-mediated receptor which confers 
responsiveness to antibiotics, and a construct comprising 
a promoter and nucleic acid encoding a VP16 activation 
domain, a DNA binding domain and howbyx mori- derived hinge 
region and ligand binding domain, optionally, a 
25 tetracycline-controlled transactivator . The number of 
copies of regulatory elements can readily be varied by 
those of skill in the .art. For example, transcription 
regulatory regions can contain from 1 up to about 50 copies 
of a particular regulatory element, preferably 2 up to 
about 25 copies, more preferably 3 up to about 10-15 
copies, with about 4-6 copies being especially preferred. 

Gene transfer vectors (also referred to as "expression 
vectors") contemplated for use herein are recombinant 
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nucleic acid molecules that are used to transport invention 
nucleic acid constructs into host cells for expression 
and/or replication thereof. Expression vectors may be 
either circular or linear, and are capable of incorporating 
5 a variety of nucleic acid constructs therein. Expression 
vectors t^^pically come in the form of a plasmid that, upon 
introduction into an appropriate host cell, results in 
expression of the inserted nucleic acid. 

Suitable expression vectors for use herein include a 
10 recombinant DNA or RNA construct (s) , such as plasmids, 
phage, recombinant virus or other vectors that, upon 
introduction into an appropriate host cell, result (s) in 
expression of the inserted DNA. . Appropriate expression 
vectors are well known to those of skill in the art and 
15 include those that are replicable in eukaryotic cells 
and/or prokaryotic cells and those that remain episomal or 
those which integrate into the host cell genome. 

As used herein, the phrase "transcription regulatory 
region" refers to that portion of a nucleic acid or gene 

2 0 construct that controls the initiation of mRNA 

transcription. Regulatory regions contemplated for use 
herein, in the absence of the non^mammalian transactivator , 
typically comprise at ■ least a minimal promoter in 
combination with a regulatory element responsive to the 
25 ligand/receptor peptide complex. A minimal promoter, when 
combined with a regulatory element functions to initiate 
mRNA transcription in response to a ligand/receptor peptide 
complex. However, transcription typicallywill not occur 
unless the required inducer (ligand therefor) is present. 

3 0 Conversely, when the non-mammalian transactivator, other 

than members of the nuclear receptor superfamily, is 
present in the host, the transactivator-responsive 
regulatory element will typically only induce transcription 
in the absence of ligand therefor. 
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Preferably, the transcription regulatory region 
further comprises a binding site for ubiquitous 
transcription factor (s) . Such binding sites are preferably 
positioned between the promoter and the regulatory element. 
5 Suitable ubiquitous transcription factors for use herein 
are well-known in the art and include, for example, Spl . 

Expression vectors suitable for use in the practice of 
the present invention are well known to those of skill in 
the art and include those that are replicable in eukaryotic 
10 cells and/or prokaryotic cells as well as those that remain 
episomal and those that integrate into the host cell 
genome. Expression vectors typically further contain other 
functionally important nucleic acid sequences encoding 
antibiotic resistance proteins, and the like. 

15 Exemplary eukaryotic expression vectors include 

eukaryotic constructs, such as the pSV-2 gpt system 
(Mulligan et al . , (1979) Nature, 277:108-114); pBlueSkript 
(Stratagene, La Jolla, CA) , the expression cloning vector 
described by Genetics Institute {Science, (1985) 228:810- 
20 815), and the like. Each of these plasmid vectors are 
capable of promoting expression of the receptor peptide of 
interest . 

In a specific embodiment, a gene transfer vector 
contemplated for use herein is a viral vector, such as 

25 Adenovirus, adeno-associated virus, or herpes-simplex virus 
based vectors, and synthetic vectors for gene therapy, and 
the like (see, e.g., Suhr et al . (1993) Arch, of Neurol, 
50:1252-1268). Adenovirus and adeno-associated virus are 
extremely suitable as gene transfer vectors to produce 

3 0 receptor peptides. Alternatively, a gene transfer vector 
employed herein is a retroviral vector. Retroviral vectors 
are gene transfer plasmids that have an expression 
construct containing an exogenous nucleic acid residing 
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between two retroviral LTRs . Retroviral vectors t\^ically 
contain appropriate packaging signals (e.g., a retroviral 
psi packaging signal), elements which enable invention 

construct integration into a host cell (e.g./ a 5' and/or 
5 a 3 ' long terminal repeat: (LTR) ) , and/or genes encoding 
proteins required for retroviral packaging (e.g., the pol 
gene, the gag gene and the env gene), that enable the 
retroviral vector, RNA transcribed using the retroviral 
vector as a template, to be packaged into a viral virion in 
10 an appropriate packaging cell line (see, e.g., U.S. Patent 
4, 650, 764) . 

Suitable retroviral vectors for use herein are 
described, for example, in U.S.- Patents 5,3 9-9,34 6 and 
5,252,479; and in WIPO publications WO 92/07573, WO 

15 90/06997, WO 89/05345, WO 92/05266 and WO 92/14829, each of 
which is hereby incorporated herein by reference, in their 
entirety. These documents provide a description of methods 
for efficiently introducing nucleic acids into human cells 
using such retroviral vectors. Other retroviral vectors 

20 include, for example, mouse mammary tumor virus vectors 
(e.g., Shackleford et al . , (19.88) PNAS, USA, 85:9655-9659), 
human immunodeficiency virus (e.g., Naldini et al . (1996) 
Science 272:165-320), and the like. 

Various procedures are also well-known in the art for 
25 providing helper cells which produce retroviral vector 
particles which are essentially free of replicating virus. 
See, for example, U.S. Patent 4,650,764; Miller, Human Gene 
Therapy, 1:5-14 (1990); Markowitz, et al . , Journal of 
Virology, 61 (4) : 1120-1124 (1988); Watanabe, et al . , 
30 Molecular and Cellular Biology, 3 (12) : 2241-2249 (1933); 
Danos, et al . , PNAS, 85:6460-6464 (1988); and Bosselman, et 
al., Molecular and Cellular Biology, 7 (5) : 1797-1806 (1987), 
which disclose procedures for producing viral vectors and 
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helper cells which minimize the chances for producing a 
viral vector which includes a replicating virus. 

Recombinant retroviruses suitable for carrying out the 
invention methods are produced employing well-known methods 
5 for producing retroviral virions. See, for example, U.S. 
Patent 4,650,764; Miller, Human Gene Therapy, 1:5-14 
(1990); Markowitz, et al . , Journal of Virology, 61(4) :1120- 
1124 (1988); Watanabe, et al., Molecular and Cellular 
Biology, 3 (12 ): 2241-2249 (1983); Danos, et al . , PNAS, 
10 85:6460-6464 (1988); and Bosselman, et al . , Molecular and 
Cellular Biology, 7 (5) : 1797-1806 (1987). 

Thus, in one embodiment, a recombinant retroviral 
vector can be utilized to express receptor peptide. 
Preferably, the retroviral vector will further comprise a 
regulatory element and exogenous nucleic acid under the 
control of the regulatory element. Optionally, the 
retroviral vector can express an antibiotic resistance gene 
(see Figure IC) . A "covector" can also be utilized to 
provide a nucleic acid construct comprising the promoter, 
the regulatory element and exogenous nucleic acid and a 
second antibiotic resistance gene. The co-vector carrying 
exogenous nucleic acid also has LTRs modified to promote 
high-level expression of exogenous nucleic acid (s) only in 
the presence of the receptor peptide encoded by the 
recombinant retrovirus and exogenous ligand therefor. Co- 
infected primary mammalian cells can then be selected using 
both antibiotics, resulting in a cell population that is 
dependent on ligand for high-level expression of the 
exogenous nucleic acid. 

30 The use of invention nucleic acid constructs allows 

the stable integration and expression of an exogenous 
nucleic acid(s) or transgene into a wide variety of 
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cultured cell t^^pes. In accordance with another embodinient 
of the present invention, there are provided recombinant 
cells containing invention nucleic acid constructs encoding 
receptor peptides as described herein, optionally further 
5 containing regulatory elements operatively associated with 
exogenous nucleic acid(s). Alternatively, recombinant 
cells constructs comprising a regulatory element, such as 
the bombyx mori receptor response element, operatively 
associated with exogenous nucleic acids, optionally further 
10 containing invention receptor peptides. 

The amount of exogenous nucleic acid introduced into 
a host can be varied by those of skill in the art . For 
example, when a viral vector is employed to achieve gene 
transfer, the amount of nucleic acid introduced can be 
15 increased by increasing the amount of olaque iorming units 
(PFU) of the viral vector. 

Suitable means for introducing (transducing) 
expression vectors containing invention nucleic acid 
constructs into host cells to produce transduced 

20 recombinant cells (i.e., cells containing recombinant 
heterologous nucleic acid) are well-known in the art (see, 
for review, Friedmann, (1989) Science, 244:1275-1281; 
Mulligan, (1993) Science, 260:926-932, each of which are 
incorporated herein by reference in their entirety) . 

25 Exemplary methods of transduction include, e.g., infection 
employing viral vectors (see, e.g., U.S. Patent 4,4 05,712 
and 4,650,764), calcium phosphate transfection (U.S. 
Patents 4,399,216 and 4,634,665), dextran sulfate 
transfection, electroporation, lipofection (see, e.g., U.S. 

30 Patents 4,394,448 and 4,619,794), cytofection, particle 
bead bombardment, and the like. The transduced nucleic 
acid can optionally include sequences which allow for its 
extrachromosomal (i.e., episomal ) maintenance , or. the 
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transduced nucleic acid can be donor nucleic acid that 
integrates into the genome of the host. 



Exemplary eukaryotic cells suitable for introducing 
invention expression vectors include, e.g., CV-l cells, P19 
5 cells and NT2/D1 cells (which are derived from human embryo 
carcinomas), ES cells (embryonic stem cells), COS cells, 
mouse L cells, Chinese hamster ovary (CHO) cells, primary 
human fibroblast cells, human embryonic kidney cells, 
African green monkey cells, HEK 293 (ATCC accession #CRL 

10 1573; U.S. Patent No. 5,024,939), Ltk" cells (ATCC accession 
#CCL1.3), COS-7 cells (ATCC under accession #CRL 1651), 
DG44 cells (dhfr- CHO cells; see, e.g., Urlaub et al . (1986) 
Cell. Molec. Genet. 12: 555), cultured primary tissues, 
cultured tumor cells, neuronal progenitor or precursor 

15 cells, such as hcn/v-iuyc (a tetracycline sensitive 
retroviral vector selected by G418 resistance) , neuronal 
cells lines such as cerebellum derived neuronal precursors 
and PC12 cells, neurons, primary astrocytes, 
oligodendrocytes, and the like. Presently preferred cells 

20 include CV-l and 293 cells. 



Invention nucleic acid constructs may be stably 
incorporated into cells or may be transiently expressed 
using methods known in the art. Cells are cultivated under 
growth conditions {as opposed to protein expression 

25 conditions) until a desired density is achieved. Stably 
transfected mammalian cells may be prepared by transfecting 
cells with an expression vector having a selectable marker 
gene (such as, for example, the gene for thymidine kinase, 
dihydrof elate reductase, neomycin resistance, and the 

30 like) , and growing the transfected cells under conditions 
selective for cells expressing the marker gene. To prepare 
transient transfectants, mammalian cells are transfected 
with a reporter gene (such as the E. coli fi-galactosidase 
gene) to monitor transfection efficiency. Selectable 
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marker genes are t\a)ically not included in the transient: 
transf ections because the transf ectants are typically not 
grown under selective conditions, and are usually analyzed 
within a few days after transf ection . 

5 In accordance with a still further embodiment of the 

present invention, there are provided transgenic animals 
and methods for producing transgenic animals capable of 
prolonged and regulated e>5)ression of exogenous nucleic 
acid(s), said method comprising introducing into early- 
10 stage embryos or stem cells: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element; 

nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in the 
cells of said host, wherein said receptor peptide 
activates said regulatory element in the absence 
of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain . 

As used herein, the phrase "transgenic animal" refers 
25 to an animal that contains one or more inheritable 
expression constructs containing one or more exogenous 
nucleic acid(s) under the transcription control of an 
operator and/or hormone response element as described 
herein . 

30 Methods of making transgenic animals using a 

particular nucleic acid construct are well-known in the 



(ii) 
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art. When preparing invention transgenic animals, it is 
preferred that two transgenic lines are generated. The 
first line will express, for example, a receptor peptide as 
described above (e.g., VbR) . Tissue specificity is 
5 conferred by the selection of a ' tissue-specif ic promoter 
(e.g., T-cell specific) that will direct expression of the 
receptor peptide to appropriate tissue, A second line 
contains a nucleic acid construct comprising a promoter and 
exogenous nucleic acid under the control of a regulatory 
10 element. Preferably, transgenic animals are produced by 
the transf ection/inf ection of embryonic stem cells which 
are employed to produce invention transgenic animals 
employing methods known to those of skill in art. 

In a presently preferred embodiment, an invention 
15 transgenic animal contains one or more expression 
constructs containing nucleic acid encoding receptor 
peptide and exogenous nucleic acid under the transcription 
control of a regulatory element. Thus, with tissue 
specific expression of the receptor peptide as described 
2 0 above and timely hormone treatment, inducible gene 
expression can be achieved with spatial, dosage, and 
temporal specificity . 



In yet another embodiment of the present invention, 
there are provided methods of inducing the transcription of 
2 5 an exogenous nucleic acid(s) in a host containing: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element; 



(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 



wo 99/02683 



PCT/US98/14215 



43 

superfamily which is not normally present in the 
cells of said host, v;her£in expression of said 
receptor peptide is under the conn-ol of an 
inducible promoter, wherein said receptor peptide 
5 activates said regulatory element in the .absence 

of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain; and 

said ligand for said ligand binding domain, 
wherein said ligand is not normally present 
in the cells of said host; 



(iii) 



10 



said method comprising subjecting said host to 
conditions suitable to induce expression of said receptor 
peptide . 

15 In accordance with yet another embodiment of the 

present invention, there are provided methods for the 
ejipression of recombinant products detrimental to a host 
organism, said method comprising: 

transforming suitable host cells with: 
20 (i) a nucleic acid construct comprising a promoter 

and exogenous nucleic acid(s) which express said 
recombinant product under the control of a 
regulatory element; wherein said regulatory 
element is not normally present in the cells of 
25 said host, and 



(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
3 0 superfamily which is not normally present in the 

cells of said host, wherein said receptor peptide 
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activates said regulatory element in the absence 
of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain, and 

5 growing said host cells to the desired level in the 

substantial absence of ligand for said receptor peptide; 
and 

inducing expression of said recombinant product by 
introducing into said host cells a ligand, which, in 
10 combination with said receptor peptide, binds to said 
regulatory element and activates transcription therefrom. 

Recombinant products detrimental to a host organism 
contemplated for expression in accordance with the present 
invention include any gene product that functions to confer 

15 a toxic effect on the organism. For example, inducible 
expression of a toxin such as the diptheroid toxin would 
allow for specific ablation of tissue (Ross et al . Genes 
and Deveiopinent 7:1318-1324 (1993)). Moreover, the 
numerous gene products that are known to induce apoptosis 

20 in cells expressing such products are contemplated for use 
herein (see, e.g, Apoptosis, The Molecular Basis of Cell 
Death, Current Communications In Cell & Molecular Biology, 
Cold Spring Harbor Laboratory Press, 1991) . For example, 
high level expression of 79 -amino acid polyglutamine tracts 

25 in cells results in apoptosis and rapid cell death. 

In accordance with still another embodiment of the 
present invention, there are provided nucleic acid 
constructs and cell lines which express a polyglutamine 
expression cassette (PQEC) under the control of an 
3 0 exogenous ligand to maximize obtaining stable 
transfected/infected cultured cells which do not undergo 
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undesired apoptosis. Expression of poiyglutamine tracts 
separately from the huntingtin protein facilitates 
identification of the toxic properties associated with the 
poiyglutamine tracts themselves. In another embodiment of 
5 the present invention, there are provided methods for 
employing these cell lines to identify novel proteins that 
could target developing intracellular aggregates (lAs) and 
either block subsequent growth or facilitate or mark the 
lAs for degradation. 

10 In accordance with still another embodiment of the 

present invention, there are provided methods for 
modulating the transcription of nucleic acid(s) in an in 
vitro system, said method comprising administering to said 
system an amount of a ligand effective to modulate the 

15 transcription of said nucleic acid{s); wherein said ligand 
is not normally present in said cellular system; wherein 
said system comprises : 

(i) a nucleic acid construct comprising a promoter 
and said nucleic acid(s) under the control of a 

20 regulatory element; and 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the- ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in the 
cells of said host, wherein said receptor peptide 
activates said regulatory element in the absence 
of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain, 

accordance with yet another embodiment of the 
invention, there are provided methods for the 
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treatment of a host in need of gene therapy, said method 
comprising: 

introducing into cells of said host: 

(i) a nucleic acid construct comprising a promoter 
5 and said exogenous nucleic acid(s) under the 

control of a regulatory element; 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
10 mammalian member of the nuclear receptor 

superfamily which is not normally present in the 
cells of said host, wherein expression of said 
receptor peptide is under the control of an 
inducible promoter, wherein said receptor peptide 
. activates said regulatory element in the absence 

of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain . 

administering, to said host, ligand for said ligand 
2 0 binding domain. 



As used herein, the term "in vivo delivery" refers to 
delivery of biological materials by such routes of 
administration as oral , intravenous , subcutaneous , 
intraperitoneal, intrathecal, intramuscular, intracranial, 
inhalational, topical, transdermal, suppository (rectal), 
pessary (vaginal), and the like. 

As employed herein, the term "ligand" (or ligand 
precursor) refers to a non-steroidal substance or compound 
which, in its native state (or after conversion to its 
"active" form), binds to the receptor peptide, thereby 
creating a ligand/receptor peptide complex, which in. turn 
can bind an appropriate response element and activate 
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transcription therefrom. Ligands function to modulate 
transcription of nucleic acid(s) maintained under the 
control of a -response element. In accordance with one 
aspect of the present invention, unless and until a 
5 suitable ligand is administered to the host, substantially 
no transcription of the desired exogenous nucleic acids 
occurs . 

Preferred ligands contemplated for use in the practice 
of the present invention are ligands characterized as not 
10 normally present in the cells of the host to be treated. 
Such ligands are referred to as being exogenous to the 
host. An example of a class of ligands not naturally 
present in mammalian systems are -compounds referred to as 
hydrazines, preferably diacyl hydrazines. 

15 Hydrazines contemplated for use in the present 

invention include compounds which are readily available and 
are/or relatively inexpensive to manufacture. One such 
compound, tebuf enczide , is a non-steroidal ecdysone agonist 
which is used commercially as an insecticide. This 

2 0 compound specifically targets lepidopteran species, 
including bowbyx mori , Tebuf enozide has undergone 

extensive testing in animal hosts and has proved to be of 
very low toxicity to mammals and other non-insect species. 

Exemplary hydrazines contemplated for use herein are 
25 mimics of the naturally occurring ecdysones, i.e., 
synthetic organic compounds which have transactivation 
activity characteristic of the naturally occurring 
ecdysones. Examples of such compounds, referred to herein 
as ecdysone mimics, include 1,2-diacyl hydrazines (e.g., 
30 tebufenozide and others described in U.S. Patent Nos . 
5,424,333 and 5,354,762, the entire contents of each of 
which are hereby incorporated by reference herein) , 
N • -substituted-N, -di-substituted hydrazines (e.g., those 
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described in U.S. Patent No. 5,117,057, the entire contents 
of which are hereby incorporated by reference herein) , 
dibenzoylalkyl cyanohydrazines (e.g., those described in 
European Application No. 461,809, the entire' contents of 
which are hereby incorporated' by reference herein) , 
N-substituted-N-alkyl-N,N'-diaroyl hydrazines (e.g., those 
described in U.S. Patent No. 5,225,443, the entire contents 
of which are hereby incorporated by reference herein) , 
N-substituted-N-acyl-N-alkyl, carbonyl hydrazines (e.g., 
those described in European Application No. 234,944, the 
entire contents of which are hereby incorporated by 
reference herein), N-aroyl-N' -alkyl-N- -aroyl hydrazines 
(e.g., those described in U.S. Patent No. 4,985,461, the 
entire contents of which are hereby incorporated by 
15 reference herein), and the like. 

Since it has been previously reported that the above- 
described diacyl hydrazines are neither toxic, teratogenic, 
nor known to affect mammalian physiology, they are ideal 
candidates for use as inducers in cultured cells and 
20 transgenic mammals according to invention methods. 



Ligands are administered in a manner compatible with 
the route of administration, the dosage formulation, and in 
a therapeutically effective amount. The required dosage 
will vary with the particular treatment desired, the degree 

25 and duration of therapeutic effect desired, the judgment of 
the practitioner, as well as properties peculiar to each 
individual. Moreover, suitable dosage ranges for systemic 
application depend on the route of administration. it is 
anticipated that dosages between about 10 micrograms and 

3 0 about 1 milligram per kilogram of body weight per day will 
be used for therapeutic treatment . 

An effective amount of ligand contemplated for use in 
the practice of the present invention is the amount of 
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ligand {e.g., diacyl hydrazine (s) ) required to achieve the 
desired level of transcription and/or translation of 
exogenous nucleic acid. A therapeutically effective amount 
is t\^ically an amount of a ligand or ligand precursor 
5 that, when administered in a physiologically acceptable 
composition, is sufficient to achieve a plasma 
concentration of the transcribed or expressed nucleic acid 
product from about 0.1 ^g/ml to about 100 fxg/ml, preferably 
from about 1.0 fig /ml to about 50 ^g/ml, more preferably at 
10 least about 2 /xg/ml and usually 5 to 10 fxg/ml. 

Ligand can be administered in a variety of ways, as 
are well-known in the art, i.e., by any means that produces 
contact between ligand and receptor peptide. For example, 
such ligands can be administered topically, orally, 

15 intravenously, intraperitoneally, intravascularly , and the 
like. The administration can be by any conventional means 
available for use in conjunction with pharmaceuticals, 
e.g., by intravenous injection; either as individual 
therapeutically active ingredients or in a combination with 

20 other therapeutically active ingredients. Ligands 
contemplated for use in the practice of the present 
invention can be administered alone, but are generally 
administered with a pharmaceutical carrier selected on the 
basis of the chosen route of administration and standard 

2 5 pharmaceutical practice. 

Therapeutic compositions containing suitable ligand 
are preferably administered intravenously, as by injection 
of a unit dose, for example. The term "unit dose," when 
used in reference to a therapeutic composition of the 
30 present invention, refers to a quantity of ligand suitable 
as unitary dosage for the subject, each unit containing a 
predetermined quantity of active material calculated to 
produce the desired therapeutic effect in association with 
the required diluent, i.e., carrier, or vehicle. It may be 
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particularly advantageous to administer such compounds in 
depot or long-lasting form as discussed hereinafter. 

Suitable regimes for initial administration and 
booster shots are variable, but are typified by an initial 
5 administration followed by repeated doses at one or more 
intervals by a subsequent injection or other 
administration. Alternatively, continuous intravenous 
infusion sufficient to maintain concentrations in the blood 
in the ranges specified for in vivo therapies are 
10 contemplated. 

In accordance with still another embodiment of the 
present invention, there are provided methods for the 
treatment of a host in need of gene therapy, said method 
comprising : 



introducing into cells obtained from said host: 

(i) a nucleic acid construct comprising a promoter 
and exogenous nucleic acid(s) under the control 
of a regulatory element; and 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in the 
cells of said host, wherein said receptor peptide 
activates said regulatory element in the absence 
of an exogenous dimer partner therefor and in the 
presence of a ligand for said ligand binding 
domain ; and 

to provide modified cells; 
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reintroducing the modified cells into said host; and 
administering, to said host, ligand for said ligand 
binding domain. 

In accordance with this embodiment of the present 
E invention, the exogenous nucleic acid is introduced 
directly into cells obtained from a donor (host or separate 
donor) , Such cells are then implanted within the host. In 
a presently preferred embodiment, the transplanted cells 
are autologous with respect to the host . Autologous means 
10 that the donor and recipient of the cells are one and the 
same . 

The concept of gene replacement therapy for humans 
involves the introduction of functionally active "wild 
ti'pe" or therapeutic" nucleic acids into the somatic cells 
15 of an affected host to correct a gene defect or deficiency. 
However, in order for gene replacement therapy to be 
effective, it must be possible to control the time and 
location at which gene expression occurs. 

Genes that encode useful "gene therapy" proteins that 
20 are not normally transported outside the cell can be used 
in the invention if such genes are "functionally appended" 
to a signal sequence that can "transport" the encoded 
product across the cell membrane. A variety of such signal 
sequences are known and can be used by those skilled in the 
25 art without undue experimentation. 

The invention will now be described in greater detail 
by reference to the following non-limiting examples. 
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Example 1 

Construction of Leni rioptsran-rl orived fusion rsa^.p tr,^ 

peptide 

The VP16 t -domain (activation domain) is fused in 
5 frame onto bon±>yx mori -derived nuclear receptor (bR) at an 
internal Mlul site near the N- terminus (VbR, Figure lA) . 
An AscI site immediately downstream of the ATG start codon 
allows multimerization of VP16 domains for VbRs with 
multiple VP16 activation domains. Note that unlike 

10 Drosophila ecdysone receptor (dEcR) , bR has virtually no 
C- terminal domain downstream of the LBD. Several amino 
acids of the hormone binding domain are ' replaced with 
sequences from the retinoic acid receptor or the thyroid 
hormone receptor (VbRRA and VbRTR, respectively) (Figure 

15 IB) . These mutant variations have properties slightly 
different from VbR, and in addition have silent mutations 
that eliminate Bell and BstBl sites in VbR. These are 
useful in the construction of many VbR-based retroviral 
constructs . 

Example P 

Construction of mnl Mf uncf i on al rf^milatorv nT-n^o■■r.p 

The small size and simplicity of the VbR system lends 
itself to the development of multifunctional regulatory 
proteins which are functional on both hormone receptor and 
25 TTA responsive promoters. VbR fusion to the tetracycline 
transact ivator results in a hybrid protein called TTMT (for 
Tebufenozide/Tetracycline Modulated Transact ivator; see 
Figure IC) . This protein functions jointly as a 
ligand-modulated regulator of gene expression from both 
tetO and EcRE (ecdysone response element) -containing 
promoters either separately or simultaneously. This 
compact protein, encoded in approximately 3 kb. confers 
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constitutive activation of promoters containing 
tetracycline operators (tetOs) in the absence of 
tetracycline/analogs . In the presence of tet analogs, the 
tetO-binding function of TTMT is blocked, deactivating 
5 responsive promoters. The VbR half of the chimeric protein 
is constitutively inactive on EcRE-containing promoters, 
but may be stimulated to transactivate responsive promoters 
to a high level by the addition of muristerone A (murA) or 
tebufenozide (teb) . Through such a dual system, two 

10 separate responsive promoters may be simultaneously 
regulated by two separate ligands through a single protein.. 
Additional variants using the reverse-Tet- transactivator , 
other VbR variants, novel response elements, and other 
regulatory proteins, will presumably result in further 

15 customized variations on this same theme. 

A single transgenic animal generated with the TTMT 
protein would be simultaneously responsive to transgene 
regulation of two separate transgenes by two separate 
ligands. Further, individual promoters can be produced 
20 that respond to both halves of the protein, allowing the 
transgene to be regulated by both ligands, at the 
discretion of the investigator. 

Example 3 
Transient transfection assav 

25 Transient transfection experiments are carried out in 

CV-1 cells or 292 cells using modified drosophila ecdysone 
receptor (VdEcR) or modified bombyx hormone receptor (VbR) 
with or without dimer partners. Luc activity is examined 
40 hours after tra nsfection and stimulation with ligand 

3 0 (Figure 2A) . Both receptors are assayed in CV-1 cells with 
I /iM MurA and no exogenous dimer partner . Note that bR 
responds, while dEcR does not. Both receptors are . also 
assayed in the presence of 5 /ig/ml tebufenozide (Figure 
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2B) . Note that VbR responds with greater than 200 -fold 
induction while drosophila EcR does not respond at all, and 
in fact, decreases slightly. Both receptors are also 
assayed with and without ditner partners. VbR is stimulated 
with 5 Mg/ml tebufenozide and VdEcR is stimulated with l /iM 
MurA. Note that RXR and Usp are necessary for VdEcR while 
VbR is constitutively activated by the presence of 
exogenous dimer partner and is inhibited by ligand 
addition. VbR, with no exogenous dimer partner, has an 
extremely low base-line which is stimulated 200-300 fold by 
ligand. Note also from this experiment that VdEcR tends to 
have a much higher baseline than VbR without dimer partner. 



A side-by side comparison of 2 . 5 /ig/ml teb and 1 /zM 
MurA on VbR show both ligands are effective at stimulating 

15 VbR. Tebufenozide is administered to cells co- transf ected 
with nucleic acids encoding receptors: LINX (which encodes 
the tet-transactivator (TTA) ) , VbR, TTMT (which encodes the 
TTA-VbR fusion protein, and TTMT-2V which is like TTMT, but 
with two VP16 t-domains) to assay for EcRE-Luc activity 

20 (Figure 2C) . Note that teb stimulates VbR, and the TTMT 
fusions, but has no action on TTA (LINX) . The same 
experiment as that described for 2C, only with a TetO-Luc 
reporter, which responds only to TTA or fusion proteins is 
summarized in Figure 2D. Doxycycline (dox) acts to block 

25 constitutive .activity of TTA. Note that in this 
experiment, TTMT and TTMT-2V work better than TTA only 
(LINX) to stimulate the TetO-luc promoter/reporter. They 
are also efficiently blocked by dox. An experiment on CV-l 
cells transf ected with TTMT and two reporters (6TetO-luc 

30 and 4-EcRE-3galactosidase) shows both promoters could be 
simultaneously influenced by the presence or absence of 
both ligands (Figure 2E) . 
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ExatTiDle 4 

Liaand mediated transaene regulation 

Transgene regulation by teb or MurA in stably infected 
rat fibroblasts is assayed utilizing recombinant 
5 retroviral constructs as the reporter, and VbR encoding 
retroviral vectors to provide the receptor. Bulk infected 
selected populations stimulated with 2.5 /xg/ml teb or 1 /zM 
MurA indicate b-gal or tyrosine hydroxylase infected cells. 
Quantitative analysis of an individual VbR-luc clonal 
10 fibroblast line, and a VbR-GH clonal line show that 0.25 
lig/ml teb is as effective at stimulating full activity as 
higher levels (Figure 3) . 

Example 5 

Single -olasmid retroviral constructs 

15 The plasmid, pBO, utilizes a responsive internal 

promoter to regulate transgene expression and also contains 
a VbR -responsive 3 ' LTR to auroregulate the VbR receptor 
itself (Figure 4) . The plasmid, pNA, is constructing using 
a 3 ' -responsive LTR to regulate transgene expression. pBO 

20 utilizes an EcR responsive internal promoter and a TTA 
responsive 3' LTR. Both are host to regulation by the TTMT 
protein, simultaneously. The purpose of this design is to 
provide chronic stimulation of the duplicated 5' LTR along 
with regulated expression of the internal EcR responsive 

25 promoter. By removing all of the native promoting 
sequences from the 6To LTR with the exception of the 
TetO's, variants of this construct are also inactivatable . 

Example 6 

Construction of olasmid vectors 



30 



VbR and TTMT systems including constitutive and 
inducible promoters are integrated into a modular plasmid 
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vector known as "pBW" (Figure 5) . The purpose of pBW is to 
integrate all of the elements of bombyx receptor- or 
tetracycline -analog responsiveness into a single plasmid 
vector destined for use in producing transgenic animals by 
5 methods such as pronuclear injection. pBW is to facilitate 
construction of transgenic animals or stably transfected 
cells using VbR variants. The pBW design simplifies the 
insertion of responsive promoters, transgenes, constitutive 
promoters, and the various VbRs, TTMTs, and related 
10 regulatory proteins. With pBW, the transgene regulatory 
properties of generated animals should be more predictable 
and reproducible from founder to founder than multi -vector 
systems . 

There are two major components of this system: the 
15 first is a tissue-specific expression cassette to produce 
the receptor and/or antibiotic resistance genes. The 
second is an expression cassette with a responsive promoter 
and a transgene. Polyadenylation signals flank all of the 
expression cassettes to provide efficient polyadenylation 
20 (p(A)) of transcribed RNAs . The use of rare restriction 
endonuclease sites within strategic locations and 
polylinkers of pBW will ease cloning of transgenes. An 
adjunct shuttle vector derived from pBSK (Stratagene) 
called SKSP contains rare sites flanking the polylinker and 
25 compatible with pBW. pBW also shares a number of sites 
with pBO to further simplify construction. pBW is 
constructed within the plasmid pcDNA3 (Invitrogen) . 



Example 7 
Regulating transof^np 
expression in target cells in vit.rn 

Individual desired elements can be inserted 
sequentially into a recombinant retrovirus. Recombinant 
retrovirus starts as a polylinker composed of 5' jyfotl-AQuI- 
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Nrul- EcoRI-AscI-Pmll-BstEl-BamRl'Hlndlll-Hpal-Clal-Nsil- 
Kpnl 3' sites. The retroviral W gin (see Adam and Miller 
in J. Virol. 62:3802 (1988) and Barklis at al . , in Cell 
47:391 (1986)) is inserted into sites of the recombinant 
5 retroviral polylinker. The internal CMV pronnoter is 
inserted BarriRl-Clal into the BaniEl-Clal sites of the 
evolving recombinant retroviral vector. All LTRs destined 
for insertion into the 5' location are produced by low- 
cycle/high-fidelity PGR production and end primers with 

10 compatible Notl-Mlul sites for insertion into the Notl-Mlul 
sites. 3' LTRs are generated with CIaI-i\7siI compatible 
ends for insertion into these sites. The R region of the 
3' LTR is included in the downstream primer sequence. 
Transgenes, receptor cDNAs, or selectable marker genes are 

15 inserted into the remaining polylinker from the EcoRl site 
through the BairiRl site, or the remaining polylinker after 
the internal promoters ( Hindi II -Cial) . 

The resulting vectors are characterized by restriction 
digests and mapping, amplified by large-scale plasmid 

20 preparation, and prepared for transfection into packaging 
cells by standard methods. Transient retroviral production 
in 2S3 and 293T. cells has been previously described (see 
Pear et al., in PNAS 90:8392 (1993)). Forty-eight hours 
after transient retroviral production, the conditioned 

25 media is removed, filtered through 0.45 mm filters, and 
frozen at -70'*C until use. 10-cm dishes of primary-cultured 
Fisher rat abdominal fibroblasts (Rosenberg et al . , Science 
242:1575 (1988)) at approximately 50% density are infected 
with 1/10 volume of virus -containing media and 8 /ig 

30 polybrene/ml for 48 hours. Infected cells are then 
selected under the appropriate antibiotic' until non- 
infected cells are cleared from the population 
(approximately 10 days) . Resistant colonies are 

trypsinized, pooled and passaged. The resulting population 
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is then infected with the second virus, and reselected 
again in media containing both antibiotics. The infection 
and selection process for the second round of viral 
infection proceeds as described for the first round. The 
5 final population of doubly resistant primary fibroblasts 
are then pooled and passaged to 6 or 24 -well Costar plates 
for assay in vitro. 

The short (24 hour) course is performed in 24 -well 
Costar dishes with cells at approximately 80% confluency. 

10 5 /xl of ligand (final concentration of 1 fiM) or vehicle 
(2 0% EtOH) is added to the 1 ml of media in each well. 
Eight hours after first ligand administration, the cells 
are all washed in PBS and the media is replaced. 3- 
galactosidase histochemical reaction is performed on 1.5% 

15 glutaraldehyde- fixed cells essentially as described by 
Shimohama at al . in Brain Res Mol Brain Res 5:271 (1989) in 
a 3 7°C environment for 2 hours. The 4 -day time course is 
performed in an identical fashion, except that the cell 
population is plated at only 20% confluency at the start of 

2 0 the experiments (and grow to near- total confluency by the 
end of the 4 day period) and neither ligand nor media is 
changed or replaced. 

For long-term culture experiments, MMBG fibroblasts 
are plated in triplicate at high density in 24 -well plates. 

25 Within 48-72 hours of plating, when the cells reach 100% 
confluency within the wells, 1 fM ligand or vehicle is 
added to one group of plates, while others are left 
completely untreated. The wells are then placed in a 37°C- 
10% CO2 environment for the next 25 days without any media 

30 changes. On day 23, the plates that had received neither 
vehicle nor ligand on day 1 are removed and half of the 
wells are given vehicle and the other half are given 1 (xM 
ligand for 4 0 hours. 
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After 25 days of nutrient deprivation and contact 
inhibition, all of the plates are processed for ^aal 
histochemical staining as above. 

MMGH fibroblasts are produced by infecting a 
5 fibroblast line with the recombinant retroviral vectors 
containing desired elements. Cells are selected under both 
antibiotics as described previously. After preliminary 
examination of ligand- induced hGH production in the bulk 
population, the MMGH population is plated at high density 

10 in 6 -well Costar plates. Forty-eight hours after the 
initial plating and when the cells are essentially 
completely growth inhibited by contact, the medium of all 
plates is replaced with DME containing 2% FBS. Under these 
conditions, primary rat fibroblasts stop dividing even if 

15 they are not confluent, and settle out into a distended 
morphology with prominent nuclei characteristic of severely 
growth- arrested fibroblasts . Six days after 

acclimatization to this culture environment, 1 ml of medium 
is harvested from each well and plate of the experiment and 

20 stored frozen at -20''C. The remaining unused media is 
discarded at the same time of harvest and replaced with 
fresh DME-2% FBS. Groups of wells are treated or not with 
1 fiVi ligand. Hormonally treated wells are washed with 
medium to remove as much residual ligand as possible prior 

25 to medium replacement when ligand treatment is 
discontinued. This routine is repeated daily for the 21- 
day extent of the experiment . 

After all time-point samples have been collected and 
frozen at -20°C, they are simultaneously processed for the 
3 0 presence of the hGH protein by ELISA (Boehringer Mannheim) 
following the protocol recommended by the manufacturer. 
ELISA data are quantitated on an MR700 microplate reader 
(Dynatech Laboratories, Chantilly, VA) and compared, to a 
standard curve generated using purified hGH to determine 
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picogram GH amounts. Control wells are TH (tyrosine 
hydroxylase) producing fibroblasts, but display all of the 
characteristics of the recombinant retrovirus except for 
the hGH transgene and secreted protein. 

5 ExamiDle 8 

Expression Vectors and Transient Transfection Assays 

All receptors were subcloned into vector LNCX (A.D. 
Miller, GenBank accession no. M28247, with an extended 
polylinker) for use in transient transfection assays. VBR 

10 (or CVBR) when referring to the retrovirus) was produced by 
insertion of VP16 (14) sequences with a synthetic ATG start 
codon into the amino -terminal region of BE up to, and 
in-frame with, the Mlul site (corresponding to amino acid 
26). VPl 6 primers : ( 5 ' ) 5 ' - GAGAGAAGCTTATGGCGCGCCCGACCGATG 

15 ( SEQ ID NO : 5 ) and ( 3 ' ) 5 ' - CACACACGCGTGTACTCGTCAATTCCAAG 
(SEQ ID N0:6). VDE (or CVDS) was produced in a similar 
fashion by fusing VP16 sequences in- frame at a novel Ncol 
site corresponding to amino acid 68. The downstream primer 
was made Ncol compatible. Plasmid template (100 ng) , each 

2 0 primer (50 0 ng) , and reaction conditions outlined by the 
manufacturer for Pwo (Boehringer Mannheim) high fidelity 
polymerase, were used with a program of 1 min at 94 °C, 
1 min at 45°C, and 1 min at 72°C for 20 cycles for 
production of all PGR products used herein. LNCX-Usp and 

2 5 RXR were produced from EcoRI fragments encoding this 

complete cDNAs, filling with the Klenow fragment of DNA 
polymerase, and inserting the fragments into the Hpal site 
of LNCX. Orientation was determined by restriction 
endonuclease digestion. Transient transfection in CV-1 and 

3 0 293 cell was performed by using standard methods (Sambrook 

et al (1989) Molecular Cloning: A Laboratory Manual 2"^ 
edition) in triplicate in 24-well plates by calcium 
phosphate coprecipitation of 100-ng receptor (s), reporter 
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plasmid E4-iuc, and pCKllO as an internal control. 
Briefly, E4-luc is four tandem EcREs inserted upstream of 
a th^Tnidine kinase gene minimal promoter directing 
luciferase expression EcRE oligonucleotides were as 
5 described {Thomas et al . (1993) Nature 362:471-475) with 
Ba2TL^I-Bg/II compatible ends. Ligand (1 /iM) was added an 
the time of transf ection, and 40 h later cells were lysed 
and extracts were used for (5-galactosidase assay and 
measurement of luciferase activity .in an analystical 
10 bioluminescence photometer. 

CVDEiR and MS vectors used only as retroviruses in 
production of stable cell lines were produced as follows: 
The TOEiR expression cassette was -constructed by modifying 
the ATG start codon of human RXRa to contain an overlapping 

15 BstXI site for fusion into the ATG start codon of the 
0.8-kb EMCV IRES (Jang et al (1989) J. Virol 63:1651-1660 
and Hoshimaru et al (1996) PNAS 93:1518-1523) sequence. 
The IRSS-RJCR cassette was then subcloned downstream of VDE 
in LNCX-ofVDE (CofVDE) , a VDE variant vector with lower 

2 0 basal transact ivation levels in superphysiological RXR 
environments, to produce CTOEiR. The MS retroviral 
reporter was produced by removing the Moloney murine 
leukemia virus enhancer core from the 3 ' long terminal 
repeat (LTR) with Miel-Xbal digestion and replacing this 

25 sequence with four tandem EcREs, resulting in a reporter 
analogous to the E4-luc reporter plasmid. 

Example 9 
Gel Mobility Shift Assays 

Gel mobility shift analysis was. performed, by in vitro 
30 translation of receptor ORFs subcloned into pBSK 
(Stratagene) , pGEM-3 (Promega) , and PSL301 (Invitrogen) in 
the presence of [-^S]Met by using T3/T7 TNT (Protnega) 
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transcription/translation and following the manufacturer's 
protocol. In vitro translated proteins were qualitatively 
examined by 5% SDS/PAGE with protocols as described 
elsewhere (Sambrook et al (198 9) Molecular Cloning: A 
5 Laboratory Manual 2'''^ edition) and quantified by 
Phosphor Imager (Molecular Dynamics) exposures of dried 
gels. Amounts of proteins used in gel shift assays were 
normalized by using quantitative data and correction for 
predicted Met residues in individual constructs. 

10 Double -stranded EcRE probes corresponding to response 
elements described above were labeled by filling of the 
Klenow fragment of DNA polymerase with [^^P] dCTP and 
unlabeled dGAT by standard methods. Reaction conditions 
for protein-probe interaction and gel electrophoresis were 

15 essentially as described by Yao et al . ((1992) Cell 71:63- 
72) except, to facilitate comparison between samples, 
reaction mixtures (including dimer partners) and probe) 
were prepared as a mixture and distributed equally to 
individual tubes with receptor proteins. The reactions 

2 0 were allowed to proceed at 23 for 5 min. at which time 
ligand or vehicle was added and the reaction allowed to 
continue for an additional 20 min. 



Example 10 

Construction of bombvx mori receptor (BE) - drosovhila 
25 melanocfaster ecdysone receptor (DE) Chimeric Receptors 



Chimeric receptors were produced by taking advantage 
of conserved DNA sequences and sites within the receptors 
and by introducing compatible sites by low cycle, high 
fidelity PCR (described above) of hombyx mori receptor 
3 0 (BE) and drosophila melanogaster ecdysone receptor (DE) 
templates to produce fusion proteins. DEBE-A/B was 
produced by PCR and replaced the BE /AB domain (amino acids 
1-197, as set forth in SEQ ID N0:3) with amino acids 1-255 
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of DE introducing a novel Apal restriction site. DEEE-C 
replaced the A/B and C domains (amino acids 1-273, as set 
forth in SEQ ID NO: 3) of BE with amino acids 1-331 of the 
DE protein by introduction of a compatible Eagl site into 
5 BE at the fusion junction. DE3E-D replaced BE from amino 
acids 1-363, as set forth in SEQ ID NO: 3, with amino acids 
1-430 of DE by Kpnl digest. BEDE was the reverse of this 
construct and replaced amino acids 1-430 of DE with amino 
acids 1-363 (as set forth in SEQ ID NO: 3) of BE by using 

10 Kpnl partial digest. DEBH was produced from DEBE-C by 
removing all BE sequences except the D domain by Kpnl 
digestion and inserting sequences encoding the intact E and 
? domains (am.ino acids 430-878) of the original DE after 
partial Kpnl digestion. VEH was produced by excision of 

15 the central region of DEBH, including the heterologous 
hinge domain, and inserted into VDE, replacing the 
analogous region. 

Example 11 

Construction of E Domain Chimeric Receptors 

20 The chimera, BEDB, was produced by insertion of the 

654-bp Kpnl-Bg/11 fragment encoding precisely the DE 
hormone -binding domain into the identical analogous sites 
in BE. Two unique sites within the hormone -binding domain 
of BEDB were used in the production of the chimeras . A 

25 unique Aatll site lies approximately 120 bp downstream of 
the 5' Kpnl site, and a unique Eagl sites lies 
approximately 44 0 bp downstream. PCR was used as described 
above, and corresponding fragments of BE with appropriate 
compatible ends were generated and subcloned into BEDB 

30 digested with Kpnl + AatlKBKE), Aatll + Eagl (BAE) , Aatll" 
•f Bg/II (BAB), and Eagl-Bg/ll for BEB. The resulting 
constructs were translated, examined by SDS/PAGE, 
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quantified, and normalized as described above. Conditions 
for gel mobility shift assay were also as described above. 

Example 12 

Infection of Cells with Recombinant Retroviruses 

5 Transient restroviral production in 293 cell types has 

been described (Pear et al. ((1993) PNAS 90:8392-8396). 
Forty-eight hours after transient retroviral production, 
the conditioned medium was removed, filtered through 
0.45-^m filters, and frozen at - 70°C until use. 

10 Ten-centimeter dishes of primary- cultured Fischer rat 
abdominal fibroblasts (Schinstine et al . (1992) Neurochem 
58:2019-2029) and 293 and CV-1 cells at approximately 50% 
density were infected at a multiplicity of infection of 
approximately 0.05 with virus -containing media and 8 mg/ml 

15 Polybrene for 48 h. Infected cells were then selected with 
4-6 mM L-histidinol until noninfected cells were cleared 
from the population: approximately 10 d for FF12s, 21 
dafor 293s, and >30 d for CV-ls (CV-1 cell populations were 
resistant to rigorous selection with i . -histidinol) . 

20 Resistant colonies from multiple plates were trypsinized, 
pooled, and passaged. The resulting bulk population was 
then split and infected with receptor-bearing virus and 
reselected again in medium containing both L-histidinol and 
400-800 /zg/ml G418. The infection and selection process 

25 for the second round of viral infection proceeded as 
described for the first round. The final population of 
doubly resistant primary fibroblasts was then pooled and 
passaged in triplicate in 6- or 24-well Costar plates for 
[3-galactosidase staining by standard methods or lucif erase 

3 0 assay. 
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Sxample 15 

CV-1 Cells Support 3E-Mediated Transactivat ion Without: 
Exogenous Dimer Partner 

Both receptors were tested to determine the effect 
5 that differential endogenous dimer partner availability had 
on transactivation of responsive promoters. Relative 
lucif erase activity employing the reporter plasmid, E4-luc, 
were performed from VDE and VBR transiently cotransf ected 
into 2S3 or CV-1 cells with either Usp, RXR, or no added 
10 heterodimer partner. Lucif erase activity was assayed in 
the presence of either vehicle, 1 /iM murA, or 1 . uM 
tebuf enozide . 

In the presence of an equimolar exogenously added 
dimer partner, VDE and VBR transactivation was similar in 

15 both cell types. With Usp, both proteins were induced less 
than 2- fold by ligand and displayed a high level of basal 
transactivation. With RXR, VDE displayed an average 
relative 5-25-fold induction across both cell types, 
whereas VBR was induced only 2.35-fold, even though the 

20 absolute level of induction matched or exceeded the 
expression with VDE. The decreased relative induction of 
VBR + RXR resulted from approximately doubled basal 
activity levels compared with VDE + RXR. With no exogenous 
dimer partner, both receptors in both cell types exhibited 

25 dramatic 15- to 80-fold decreases in basal transactivation. 



With no exogenous dimer partner, VDE failed to respond 
to tebuf enozide , whereas VBR continued to respond well to 
both murA and tebuf enozide . The addition of RXR to 
VDE-transf ected 293 cells increased the maximum 
3 0 murA- treated expression level by only 2 0%, indicating that 
the high level of endogenous RXR in 2 93 cells is. near 
saturation for heterodimerization with DE. By comparison. 
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CV-1 cells supported murA stimulation of VDE at only 13% of 
the expression level of VDE with added RXR (Fig. is) . VBR 
was active at levels similar to VDE in the high RXR 
background of the 293 cells; however, in the CV-1 cells, 
5 tebufenozide-stimulated VBR exhibited both the highest 
absolute level of transactivation and the greatest relative 
induction (160.2-fold) of any of the other combinations of 
receptors and ligands tested. In the CV-l cells, 
tebufenozide-stimulated VBR displayed 21-fold greater 
10 relative induction and an absolute expression level 9.25 
times the level of VDE treated with mutA. 

Example 14 

BE Displavs Faci litated Heterodimer Formation with Usp 

and RXR 

15 The high level function of VBR in the low heterodimer 

partner environment of the CV-1 cells suggested that BE 
might be capable of efficient function with lower levels of 
dimer partner than DE . To explore this possibility, 
cell-free experiments with gel mobility shift analysis and 

20 equivalent amount of in vitro translated BE and DE in 
combination with Usp or RXR dimer partners and both ligands 
were performed. Neither protein alone formed a dimeric 
complex with the EcRE probe unless either the Usp or RXR 
dimer partner proteins were added. In the presence of Usp, 

25 DE formed a comparatively weak interaction with the EcREs 
that was increased approximately 4 -fold by murA and 2-fold 
by tebufenozide. In contrast, BE produced a strong shift 
with Usp that was only slightly influenced by either 
ligand. In the absence of any ligand, BE bound probe 5 

30 times more efficiently than DE. With ligand, the BE probe 
shift was observed to still exceed the DE shift and was 
double the DE + Usp band volume. Both receptors combined 
with RXR displayed an absolute dependence on ligand. for 
heterodimer formation; however, whereas BE + RXR with 
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either ligand displayed a prominent shifted band., . an 
equivalent amount of DE resulted in a barely detectable 
shifted band with murA and no detectable band shift with 
tebuf enozide . The BE + RXR tebuf enozide- and murA- induced 
5 shifts were determined to be ■ >15 times the DE + RXR shift 
with murA. 

Example 15 

BE-DS Chimeras Reveal That the D Domain Is Critical to 
High Affinitv Heterodimerization 

10 Gel mobility shift assays indicated that BE had higher 

affinity heterodimer formation than DE . To further define 
the subdomains and molecular determinants resulting in the 
BE high affinity phenotype, PCR mutagenesis and internal 
shared restriction endonuclease sites were used to produce 

15 BE-DE chimeric receptors that were assayed for their 
ability to bind the EcRE probe with either Usp or RXR and 
with or without ligand. Because previous results indicated 
that murA stimulated both EcRs, for purposes of direct 
comparison only murA was used for gel shift analysis of the 

2 0 chimeras. Fusion constructs began at the N terminus of BE 
and sequentially replaced the A/B (DEBE-A) , C (DEBE-C) , 
and D (DEBE-D) domains with DE sequences. BEDE was the 
reverse, encoding BE up to the D-E domain boundary with the 
DE E and F domains. DEBE-A and DEBE-C with Usp shifted 

25 nearly as efficiently as native BE. The absence of shift 
for DEBE-D, however, and the substantial shift seen for 
BEDE 4 Usp suggested that the high affinity determinant for 
BE-Usp heterodimerization resided in the D, or hinge 
domain. The same constructs in combination with RXR 

30 revealed a different shift pattern. BE and DEBE-A both 
responded strongly to ligand to bind and shift the probe. 
DEBE-C with ligand was approximately 40% decreased relative 
to native EE; however, it still displayed 16-fold greater 
binding than native DE. DEBE-D had lost all high affinity 
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DNA-binding complex formation and functioned 
indistinguishably from the native DE with or without 
ligand. MurA-stimulated BEDE + RXR produced a band shift 
4 times the intensity of the native DE + rxr stimulated 
5 with murA, but BEDE was clearly significantly impaired to 
heterodimerization with RXR compared with native BE and 
other higher affinity chimeras like DEBE-A and DEBE-C. 

For further confirmation of the role of the BE D 
domain in dimer partner affinity, the ' D region of DE was 
10 replaced with the BE D domain to produce DEBH. Native DE 
and DEBH were compared side by side for binding to EcREs 
with both Usp and RXR dimer partners. Gel mobility shift 
of in vitro translated and normalized DE and 
hinge- substituted DEBH was performed, in the presence of 
15 either Usp or RXR, and further in the presence of vehicle 
or 1 iiM murA. Compared with DE + Usp, DEBH + Usp averaged 
5-fold greater probe binding with murA and 9-fold greater 
binding without ligand. DEBH + RXR exhibited both 
decreased probe shift in the absence of ligand and 
increased probe binding with murA (approximately 3 -fold 
over native DE) for a 10-fold relative induction by ligand 
compared with 2.5-fold for DE. Transient transfection 

analysis of a VP16-DEBH construct (VEH) in CV-l cells 
revealed that VEK shared characteristics of both DE and BE. 



20 



25 



30 



VEH displayed the high basal trans activation level and 
low relative induction (average of 2.5 fold) characteristic 
of BE with superphysiological RXR. VEH displayed 5- to 
8 -fold greater basal and murA-activated expression than 
native DE in the absence of any exogenous dimer partner. 
MurA-stimulated VEH expression exceeded even the expression 
level for murA-stimulated BE by nearly 2-fold. As 
predicted, without the BE hormone -binding domain, DEBH did 
not to respond to tebuf enozide . Additional chimeric 
proteins with a subset for BED-region sequences did' not 
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function as well as DEBH wirh replacement of the enrire D 
domain, suggesting that multiple discrete determinants 
presumably over much of the hinge region contribute to the 
high affinity phenotype . 



5 Example 16 

Discrete Functional Regions Within the E Domain Revealed 

bv BE -DE Chimeras 

Although the preceding data reveal that the BED domain 
alone was sufficient for high affinity heterodimerization 

10 with Usp, a comparison of the relative shift of the DEBE-D 
and BEDE chimeras with both partners suggested that 
additional determinants within the BE hormone -binding E 
domain contributed to a high affinity interaction with RXR. 
1j further study the BE E domain, chimeric proteins were 

15 produced by using BEDE as a template. Five E-domain 
chimeras named BEDB, BKE, BAE, BAB, and BEB were prepared. 
BEDB is identical with BE with the exception of complete 
replacement of the E domain with corresponding sequences 
from DE; BEDB was identical with BEDE with the exception of 

20 replacement of the large C-terminal DE F domain for the 
20-amino acid F domain of BE. BKE, BAE, BAB, and BEB are 
derived from BEDB and are sequential, C-terminal moving 
replacements of three regions within the DE-derived E 
domain. Gel mobility shift assays with BE, BEDE, and BEDB 

25 revealed that BEDE and BEDB are essentially identical in 
their properties of complex formation, suggesting that the 
F domain does not play a significant role in 
heterodimerization and DNA binding. In addition, BEDB 
recapitulates the high affinity USP binding and the low 

3 0 affinity RXR binding of BEDE. 



The BEDB-derived chimeric receptors reveal several 
functional subdomains over most of the BE E domain. By 
using unique internal sites, the E domain' was subdivided 
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into approximately thirds called E, E, and E3. Treatments 
were with vehicle (-), 1 murA (M) , or 1 /zM tebufenozide 
(T) . Chimera BKE with replacement of the DE E, and E^, 
regions and chimera BAE with replacement of only the E^ 
5 region displayed very similar patterns of shift. Both 
chimeras were significantly impaired in complex formation 
with Usp and RXR relative to the original BEDS chimera, 
suggesting that there were fundamental incompatibilities 
between subdomains of the Bowbyx and Drosophila E domains. 
The similarity of these two constructs suggested that DE 
and BE E, regions were likely to be functionally similar, 
but that the E^ regions had very different properties. 
Curiously, although both basal and murA-stimulated 
heterodimerization were inhibited, ■ BKE/BAE-Usp 
15 heterodimerization was significantly stimulated by 
tebufenozide, indicating that the tebufenozide response 
determinant is within E^ but that the high affinity RXR- 
binding determinant lies elsewhere. 



25 



When E2 was combined in tandem with E., in the chimeric 
20 receptor BAB, high affinity heterodimer formation 
indistinguishable from native BE heterodimerization was 
observed with both Usp and RXR. To determine whether the 
E3 region alone could confer high affinity 
heterodimerization with RXR, the chimera BEB, with 
substitution of only the E3 region, was produced and tested. 
BEB displayed a pattern of response essentially identical 
with the original BEDB with low-level interaction with RXR 
and only upon stimulation with murA. BEB, which lacked the 
E2 region of native BE, was not responsive to tebufenozide. 
Taken together, these results indicate that the BE D domain 
and E3 region, in combination with ligand- binding 
determinants within the E^ region, function in concert to 
produce high affinity, ligand -dependent heterodimerization 
between BE and the RXR protein. 



30 
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Example 17 

Comparison of and VPS Function in Stablv Transduced 

Cells 

Building on transient transfection and gel shift data, 
5 DE an EE variants were directly compared for their ability 
zo function at low or single copy in stably transduced cell 
types. In addition to direct comparison of CVDE and CVBR 
retroviruses, an RXR-bearing VDE variant called CVDEiR was 
also tested that previous studies had determined to 

10 provide the best combination of low basal e^cpression and, 
relatively high induction for VDE cotransf erred with 
supplemental RJIR. These three receptor-bearing vectors and 
an empty LNCX control were used to infect 293, CV-1, and 
primary fibroblast cells harboring an EcRE-responsive. 

15 retroviruses termed MS. MS encoded a 3' LTR with the core 
enhancer elements replaced with four tandem EcREs and an 
internal simian virus 4 0 promoter vector directing 
resistance to the drug L-histidinol . Parallel MS vectors 
encoding either the LacZ gene (MS-Z) or a luciferase 

20 transgene (MS-luc) were used in this comparative study, the 
former as an indication of the number of responding cells 
and the latter for quantitative purposes. 

Three cell types, 293s, CV-Is, and rat primary 
fibroblasts (line FF12) , were selected for assay of 

2 5 reporter /receptor virus combinations based on their 

relative capacities to supports high level DE-mediated 
transactivat ion . The human 293 cell line support 

essentially full DE activation with no requirement for 
added RXR, whereas CV-ls require the contransf ection of 

3 0 exogenous dimer partner for DE function. FF12s were 

selected as the third cell type because they are refractory 
to chemical methods of transduction and reliant on the use 
of biological vectors such as recombinant retroviruses for 
efficient gene transfer. Furthermore, primary fibroblasts 



wo 99/02683 



PCT/US98/14215 



72 



are frequently used as autologous donor cells for 
transplantation approaches to somatic gene therapy. 

All three cell types were infected with both the 
reporter and receptor-bearing vectors and treated for 72 h 
5 with either vehicle, 1 murA or 1 tebuf enozide . 

Histochemical staining of MS-Z infected cells indicated 
that all three receptor types were capable of high level 
ligand-responsive transactivation in coinfected 293 cells. 
DE- derived constructs responded well to murA but were 
10 unable to respond strongly to tebuf enozide . CVBR, on the 
other hand, responded moderately to murA but strongly to 
tebuf enozide. MS-LacZ cell types infected with an empty 
LNCX vector displayed no staining when stimulated with any 
ligand and further confirmed observations from 
15 vehicle -treated cells that the MS vector permitted only 
relatively low basal levels of transgene expression. 
Approximately 30% of CVDE-, CV-DEiR- , and CVBR-inf ected 
MS-2 293 cells responded to murA. Tebufenozide stimulated 
<1% of CVDE-infected cells and <4% of CVDEiR- infected MS-Z 
20 293s. More than 50% of CVBR-inf ected MS-Z 293s responded 
to tebufenozide, however, exceeding all DE-based vectors 
with murA. More dramatic differences were seen in CV-l and 
FF12 MS-Z infected cells. CVDE alone was not inducible in 
either cell type, indicating that the endogenous level of 
25 RXR in either CV-ls or primary fibroblasts is not 
sufficient to support VDE-mediated transactivation. When 
the endogenous level of RXR is supplemented, as in CVDEiR, 
3-10% of the infected population was observed to strongly 
respond to murA. CVBR, however, was clearly the most 
3 0 potent transact ivator in the CV-l and FF12 cell types with 
20% of CV-l and 55% of CVBR-infected FF12 cells 
histochemically positive. 
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Luciferase activity in MS-Luc cells infecred in 
parallel with the same receptor viruses confirmed that CV3R 
v;as not the most potent transact ivator in the CV-l and FF12 
cell t\^es and that all receptor t\'pes were functional in 
5 253 cells. The MS-2 data suggest that the low absolute 
level of CVDEiR induction in CV-l and FF12 cells with murA 
most likely reflects a small number of robustly responding 
cells in a largely nonresponsive population as opposed to 
low level induction by most MS-luc-inf ected cells. 

10 Example 18 

Transient transf ections 

Transient transf ections are- performed by calcium- 
phosphate coprecipitation employing standard methods (see 
Sambrook et al., in Molecular Cloning: A Laboratory Manual. 

15 Cold Spring Harbor Press, New York, New York (1989) ) . All 
tissue culture experiments are performed using DMEM 10% FBS 
in a 10% CO2 incubator unless otherwise specified. All 
transf ections are performed in triplicate in 24 -well Costar 
plates using CV-i cells at an approximately BXIO'^ plating 

20 density. Immediately following transf ection, 1 /xm ligand 
in 20% EtOK/?BS is added to wells. After 40 additional 
hours of incubation, cells are harvested and luciferase 
activity measured in an analytical bioluminescence 
photometer. Cell extracts are simultaneously examined for 

25 LacZ activity of the internal control by standard methods. 
Bar graph levels represent relative luciferase activity 
after correction using internal control values. All 
molecular biology enzymes and reagents used in this study 
are provided by either NEB (Beverly, MA) or Strategene (La 

30 Jolla, CA) . 
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Exampl p 1 Q 
Prolonged tra nsaene pxpressinn fn-r 

vivo Qe.r\e theramr ^ pnlir.^t i r.r.^ 

Autologous, explanted skin fibroblasts genetically 
5 modified to express tyrosine hydroxylase (TH) , the enzyme 
responsible for synthesis of L-dopa and the precursor for 
the neurotransmitter dopamine, have proven to ameliorate a 
loss of local dopamine in animal models of Parkinson's 
disease. Although quite effective in providing dopamine to 
10 the area of neural damage, this technique is only 
therapeutically useful for 2-3 weeks following 
transplantation. The decrease in effectiveness has been 
traced back to a dramatic loss of TH transgene expression 
•■isplanted, post-mitotic cells. The use of invention 
15 constructs allows one to overcome the loss of transgene 
expression by providing stimulation of the retroviral LTR 
promoter through either ligand-activated transactivating 
complexes or through constitutive transactivating receptor 
variants. m this way, transgene expression may be 
20 maintained for longer periods of time, even indefinitely if 
desired. 



Stem- type cells, such as stem cells of the 
hematopoetic system, nervous system, or embryo, could be 
infected or transfected with VbR regulated transgenes and 

25 subsequently implanted into adult, fetus, or early 
embryonic animals for either therapeutic or research 
purposes. Cells of the hematopoetic system could 
conditionally express proteins producing blood clotting 
factors such as factor IX, metabolic factors such as 

30 glucocerebrosidase, or protective factors including 
anti-HIV proteins. 
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Example 2 0 
Prolonged transaene expression for 
in vivo crene theraov apolicat 3-ons ' 

The small size and regulator^'- capacity of VbRs lend 
5 themselves to use in recombinant retroviruses as a method 
of gene transfer. VbRs have been introduced into both MLV 
and lent iviral -based retroviral systems . When these 
viruses are introduced directly into target cells of either 
a mature or developing organism, expression of the virally 
10 encoded transgenes may be regulated by systemic addition of 
ligands such as tebuf enozide and derivatives. An example 
of a disease that could be theoretically ameliorated by 
application of in vivo VbR encoding retroviruses is 
Parkinson's disease, described above. 

15 Another application would be to use VbR encoding 

retroviruses as an anti-viral agent. Lentiviral vectors 
with regulated properties and harboring suicide genes or 
"protective" proteins could be used as a means of 
conditionally depleting or destroying HIV-positive cells. 



20 Example 21 

Modulated transaene expression for either 
in vivo or ex vivo aene therapy application 



Treatment of Parkinson's disease with the chemical 
precursor of dopamine, L-dopa, has proven effective in 

25 ameliorating many of the deficits of Parkinsonism. With 
time, however, patients become refractory to L-dopa 
therapy, with the deleterious effects of chronic treatment 
outweighing even the serious symptoms of the disease 
itself. Eventually, patients are left with few therapeutic 

30 options. While the transplantation of TH expressing -cells 
may be effective when constantly producing low- levels of L- 
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dopa, a potentially far more beneficial approach would be 
to allow the physician some degree of control over L-dopa 
production in the patient. This would allow sufficient 
control to ensure. that the transgenic factor is expressed 

5 at appropriate therapeutic levels. At times when 
endogenous systems are capable of providing full function, 
the transgene may be allowed to become quiescent and 
transcriptionally inactive until needed again. Because the 
transcriptional induction of the invention retroviral 

0 constructs is dependent on an exogenous ligand, expression 
of an integrated therapeutic transgene can be placed under 
the control of the physician and patient. 



20 



Example 22 

Construction of a BORIS vector expressing Polvn-aT^P 

15 PolyQ-GFP are constructed from an existing GFP clone, 

GFP-SKSP. PGR primers encoding a novel restriction site 
and ATG start codon at the 5' end, and an Age I site 
in- frame with the GFP ORF at the 3' end are obtained for 
PGR amplification of huntingtin polyQ regions. These PGR 
products are inserted in-frame into GFP-SKSP to produce the 
fusion constructs of pQ-GFP. The clones are analyzed for 
mutations and function by sequencing and in vitro 
translation to confirm the creation of a pQ-GFP ORF. 
Alternate antibiotic resistance BORIS regulatable 
25 retroviral vectors are produced by removing the 
L-Histidinol gene from plasmid LSHL, by PGR, and inserting 
it in place on the neo^ into a BORIS vector for production 
of BORIS-LHis. The plasmid are large-scale prepped. A 
preliminary test for function is transient transfection 
experiments into transfectable cell types such as 293 cells 
and examination for the expression of beta-galactosidase or 
GFP reporter genes. Retrovirus are transiently produced 
and used for infection of target cells. Infected cells are 
selected in L-Histidinol containing media to ensure proper 
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selection. polyQ-GFP is then inserted into regulated 
retroviral vectors. pQ-GFPs are excised from SKSP and 
inserted into the regulatable BORIS and BORIS-LKis vectors. 
The plasmids are large-scale prepped. Function is 
5 determined by examination for green fluorescence and 
altered phenotype. Additional plasmid are used for 
transient production of retroviruses. 

Primary rat fibroblasts, and the hcnlv-myc are selected for 
initial testing of BPQGS. Both are infected with the 

10 appropriate vector (s) place under selection, and individual 
resulting colonies subcloned. The subclones are expanded 
and passaged for analysis . Infected cells are treated or 
not V7ith their respective ligands and the resulting 
phenotypes analyzed. Preliminary analysis will examine GFP. 

15 fluorescence and distribution within the cells. At timed 
intervals following ligand induced expression, the pattern 
of fluorescence are recorded to determine if the fusion 
protein changes in distribution. Morphological 
characteristics of stimulated cells their growth rates, and 

20 cell death will also be examined at intervals following 
induction. For the hcn/v-myc cells, the consequences of 
polyQ-GFP expression in the presence and absence of 
tetracycline are compared. 

Example 23 

2 5 Aaareaation of polvO- fusion proteins 

GFP fusion proteins with long (97Q) , short (13Q) , or 
no N- terminal polyQ tracts were constructed. Unique 
restriction endonuclease sites were designed into the 
regions flanking the polyQ tracts to allow insertion of an 
30 SV40 nuclear localization signal (NLS) or other sequences 
either C-terminal to the polyQ tract. Transient 
transfection analysis of these constructs in the. NIT 
expression vector revealed that cells transduced with GFP 
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only or 13Q-GFP expressed GFP uniformly throughout their 
cytoplasm and that the NLS variant i3QN-GFP localized 
predominantly in the nucleus with a uniform distribution. 
97Q- and 97QN-GFP constructs, on the other hand, initially 
5 manifested as uniform GFP positive fluorescence throughout 
the cytoplasm or nucleus, but within 24-72 hours after 
transfection condensed into bright granules indicative of 
intracellular aggregates (lAs) . lAs were localized within 
the cytoplasm, surrounding the nuclear envelope, and within 
10 the nucleus itself (Figure IB) . The presence of the SV40 
nuclear localization signal (NLS) , though observed to 
efficiently direct localization of either 13Q- or 97Q-GFP 
variants to the nucleus, resulted in a quantifiable 
difference in the rate of lA formation after transfection. 

15 Example 24 

GFP- fusion prote ins v/ith short polvalutamine tracts are 

recruited into lAs 



It was sought to determine whether overexpression of 
13Q-GFP constructs would inhibit or interfere with the 

2 0 formation of lAs from 97Q-fusion constructs. 13Q-GFP was 
not observed to exert any inhibitory influence on 97Q-GFP 
mediated lA formation, and contrary to expectations, 
appeared to result in a qualitative difference in the lAs 
that formed. To test whether the 13Q-GFP fusion proteins 

25 were participating in the formation of lAs, the 
GFP-encoding portion of 97Q-GFP (97QA) was deleted and this 
construct was cotransf ected with 13Q-GFP into cultured 
2 93 cells. Fluorescent lAs were readily observed in a 
large population of the GFP-positive cells within the 24-72 

30 hr. time course observed for 97Q-GFP alone. 13Q-GFP/97QA 
doubly infected cells were distinctive from 97Q-GFP cells 
because they continued to display faint uniform cytoplasmic 
GFP positivity in addition to the formation of 
predominantly nuclear lAs, whereas lA formation within 
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97Q-GFP cells generally results in condensation of all GF? 
fluorescence into lAs leaving ouher areas of the cell 
nonf luorescent and dark. Control cotransf ection of 
NIT>S7QA with NIT-GF? constructs with no N- terminal 
5 polyglutamine tracts did not result in the formation of lAs 
under any conditions tested, indicating that the short 
polyQ tract of 13Q-GFP is necessary for participation in lA 
formation. 

Preliminary studies of 13Q-GFP recruitment were 
10 performed with a molar excess of 97QA plasmid. To further 
determine the molar ratio of S7Q to 13Q required for 
participation of 13Q in lA formation, 13Q-GFP was titrated 
with 97QA in a cotransf ection assay and the percent of lA 
containing cells quantified relative to the total number of 
15 visibly GPP positive cells. No lAs were observed with non 
cotransf ected 97QA or at a 0.1 molar ratio of 97QA plasmid; 
however, at a 0.5 molar ratio of 97QA to 13Q-GFP, 10% of 
GFP positive cells contained GFP positive lAs . At a 1:1 
ratio, nearly 40% of GFP positive cells also displayed lAs . 
20 At 10:1 ratio of NIT-S7QA to NIT-13Q-GFP, over 75% of 
positive cells displayed lAs 60 hours post-transf ection . 

Example 2 5 

Characterization cf lAs formed from a combination of long 
and short polvO-reporters 



25 To better characterize the formation of lAs in the 

presence of both long and short polyQ tracts, 97QA was 
modified to contain an in-frame fusion of the LacZ gene 
encoding (5-galactosidase (97QZ) . Preliminary transient 
transfection studies of 97QZ revealed that like 97Q-GFP, 

30 97QZ condensed within transfected cells to form both 
immunopositive and histochemically reactive lAs . NIT-97QZ 
and NIT-13Q-GFP were cotransf ected into cells and 60. hours 
later, performed immunohistochemistry and histochemical 
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reactions on the transfected cells to simultaneously 
examine both reporter constructs. Although the 

3-galactosidase {3-gal) histochemical reaction obscures GFP 
fluorescence, through slight modification of standard 
5 staining techniques and limiting the time of exposure to 
reagents, a low level of GFP fluorescence could be 
preserved and imaged along with the characteristic blue 
stain of the 3-galactosidase reaction. A population of lAs 
are observed that are both GFP and 3-galactosidase 
10 positive. Numerous large 3-gal only positive lAs are also 
present, presumably from the aggregation of 97QZ only (or 
predominantly) . Curiously, several lAs that appear only 
positive for GF? were also observed using the histochemical 
stain and native GFP fluorescence imaging. 

15 Further exploration of this observation was performed 

using confocal microscopy and immunohistochemistry to 
identify reporters within lAs . As in the above study, 
individual lAs were clearly observed to be positive for 
both reporters while other lAs were positive only for 3-gal 

2 0 for GFP. Taken together, these results clearly indicate 
that long polyglutamine tract -reporter fusion constructs 
form lAs in vitro, and that the forming aggregates can 
either recruit short polyglutamine-reporters into the 
aggregate, induce them to aggregate on their own, or both. 

25 Example 26 

Construction of cell lines with regulated lA formation 
and 970-GFP exnression 

The study of lA formation has thus far relied heavily 
on transient transfection analysis and has been hampered by 
a lack of stable lA- forming cultured cell populations. 
Through the use of retroviral vectors, a variety of stably 
cultured cell lines conditionally or chronically expressing 
the polyQ-GFP constructs were constructed. Two regulatable 



30 
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vector systems, an induced expression system based on a 
modified ecdysone receptor variant (VbR) and a repressed 
expression system based on the tetracycline transactivator 
(TTA) , were used to create cell lines with inducible lA 
5 formiation. These two vectors are best suited to different 
ranges of induction in cultured cell types. VBR-based B3 
vectors are particularly suited to applications where very 
low basal expression levels that can be rapidly stimulated 
to moderately high levels are desired.. The TTA-based NIT 

10 vector tends not have the broad-based low level of 
''uninduced" expression characteristic of the BORIS vectors 
(identified in SEQ ID N0:6); however, because 
transactivator expression and transactivation is continuous 
in the absence of ligand, very high transgene expression 

15 levels may be achieved through the use of this vector. 

97Q-GFP was cloned via a custom shuttle vector into 
both retroviral vectors. Transiently produced B3-polyQ-GFP 
retroviruses {B3-13Q or 33-97Q) and TTA-based NIT polyQ-GFP 
retroviruses (NIT-13Q or NIT-97Q) were used at low MOI to 

20 infect several well-characterized cell lines used 
previously in transduction studies of lA formation. COS- 7 
cells were selected because several studies had examined lA 
formation and lA.- induced apoptosis in this cell type. 
2S3 cells were utilized for the same reason and also 

25 because they are amenable to efficient transfection and 
would have significant utility in the development of 
gene-based factors that interact with or interrupt lA 
formation. Primary rat fibroblasts were selected because 
they are non-transformed and readily enter a prolonged 

30 post -mitotic state following contact inhibition at high 
culture densities. 

Pilot studies with reporter constructs revealed that 
the E3 vector functioned most efficiently in the 293 and 
COS cell lines and that NIT functioned at highest 
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efficiency in the primary fibroblast line. Individual cell 
lines infected with the appropriate vector were placed 
under G418 selection to remove uninfected cells. For 
B3-polyQ infected 293 and COS-7 cells, the resulting 
5 resistant population was pooled and passaged without 
ligand. For NIT-polyQ infected primary fibroblasts, cells 
were cultured with the tetracycline analog doxycycline 
prior to, during, and after infection to prevent chronic 
polyQ overexpression to disrupt growth of resistant 
10 colonies. 293 and COS-7 cells were also infected with NIT 
constructs and cultured without added doxycycline to 
determine the effects of chronic overexpression of 97Q-GFP 
on these cell types as described below. 

Example 27 

15 Stable IA~formina 293 cell.g 

G418-resistant B3-97Q 293s displayed little to no 
visible GFP fluorescence during culture in the absence of 
ligand; however, introduction of 1 fiM tebufenozide into the 
culture medium resulted in a rapid increase of cytoplasmic 

20 fluorescence within 24 hours post -induction. Nearly 100% 
of cells would display GFP-positive fluoresence within 
24-72 hours of induction. After 24 hours, only widely 
scattered cells would contain small, detectable lAs . By 
4 8 hours, >8% of the cell population would contain one or 

25 more lAs increasing to >15% of the population between 48 
and 72 hours. Occasional lA containing cells would be 
observed floating in the culture medium; however, there was 
no evident ^Vave" of cell death at the 72 hour time point 
or later times. Passage of B3-97Q 293s in the presence of 

30 ligand for three passages resulted in only a small but 
detectable decrease in the percentage of cells containing 
lAs (15% to 11%) . 
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NIT-97Q infeccion sheds further light on the toxic 
potential of long-polyglutamine repeats on 293 cells. 
Surprisingly, colonies of 100% lA-positive 293 cells 
survived for over two weeks and apparently divided and 
5 grew, in some cases, to colonies of hundreds of cells 
(Figure 4D) . By the end of 21 days of culture; however, 
the majority of lA positive colonies had disappeared 
leaving only scattered lA positive cells and numerous 
colonies of GFP positive cells without lAs that eventually 
10 grew to fill the plates. The e^ipression time-course and 
level of B3- AND NIT-13Q cells paralleled the 97Q cells 
although no lAs were formed. 

Example 28 
Stable lA-form.ina COS-7 cells 

15 . G4ie -resistant B3-97Q C0S-7s displayed no visible GFP 

fluoresence during culture in the absence of ligand in a 
manner almost identical to the 2 93 B3-97Q population. 
Addition of 1 tebufenozide into the culture medium 

resulted in a rapid increase of cytoplasmic fluorescence 

20 within 24-48 hours post -induction . lA formation peaked at 
4-5% of total cells (at any given moment) 72-96 hours 
post-induction. Unlike the B3-97Q 293 cells, a continuous 
ooDulation of G41S-resistant cells did not display even GFP 
fluoresence suggesting that one or more components of the 

25 regulatable system were lost from the COS population. Like 
B3-97Q 293 cells, lA containing COS-7 cells were also 
observed floating in the culture medium; however a 
depletion of the expressing population was not observed 
presumably because the continuous formation of lAs in other 

30 cells. 

The effects of chronic 97Q-GFP overexpression in 
infected COS-7 cells was examined using the NIT vector. 
Like NIT-97Q infected 293s, colonies in which the majority 
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of cells contained large and numerous lAs were readily- 
observed. Unlike the overexpressing 293 cells; however, 
several selected colonies continued to grow and express lAs 
in a continual percentage of the population even after 
5 passage. Three lines, C0SN97Q-4, 6 and 7, continually 
express lAs after continuous growth for over four weeks. 
COSN97Q-7 displays the highest level of spontaneous lA 
formation. At any given time, approximately 6% of the 
total population contains readily observable lAs . CV-1 
10 cells, a predecessor of the COS-7 cells were also infected 
with NIT-97Q and formed lA containing colonies in much the 
same manner as COS cell populations. 

Example 2 9 
Stable lA forming primary fibroblasts 

15 B3-97Q infected primary fibroblasts (FF12s) became 

faintly GFP fluorescent within 72 hours of tebufenozide 
treatment but did not achieve a maximum level of 97Q-GFP 
expression sufficient to result in lA formation in a 
significant population of cells. The NIT vectors were 

20 employed to generate regulated 97Q-GFP expression. 
Infected NIT-97Q FF12s were infected and cultured 
continuously in the presence of doxycycline (dox) to 
prevent lA formation. Twelve individual colonies of 
NIT-97Q FF12S were selected in order to screen for 

25 individual cell populations with low dox inhibited 
expression and high 97Q expression in the absence of 
ligand. One line, FFN97Q-5 displayed GFP fluoresence 
within 90% of the confluent cell population within 5 days 
of ligand removal. lAs began forming on day 5 and 

30 continued to increase through day 8. Approximately 15% of 
the total cell population displayed lAs 10 days post ligand 
removal, lA formation in the confluent FFN97Q-5 population 
was distinctive from other cell types examined in that it 
was occasionally exclusively nuclear and at other times 
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predominantly cytoplasmic in an unusual patLern wirhin zhe 
cell body. lAs with a distinctive stellate or tri-parrite 
appearance were often observed. Large cytoplasmic 
aggregates of multiple star-like lAs were also occasionally 
5 seen. Curiously, confluent FFN97Q-5 cells with multiple 
cytoplasmic or nuclear lAs appeared visually normal. 
Floating dead cells with or without lAs were rarely 
observed and in numbers no different from control wells. 
FF970Q-%s cultured in the presence of dox neither expressed 
10 visible 97Q-GFP nor formed lAs . 

Example 3 0 
Use of invention for efficient 
production of transgenic animals 

Transgenic animals are generally produced by either 

15 pronuclear injection of DNA or by transfection of embryonic 
stem (ES) cells followed by selection and injection of the 
stem cell into the inner cell mass of very early embryos. 
Pronuclear injection results in approximately 5-10% stable 
gene transfer in the production of transgenic mice. The 

20 use of ES cells in producing transgenics is likewise 
inefficient in generating mosaics with germ.-line 
transmission of the transgene . It was proposed in the mid- 
1980 's to use retroviruses to transfer transgenes with high 
efficiency into early embryos or ES cells to dramatically 

25 enhance the odds of producing transgenic animals. All 
attempts at this failed, not because the virus was 
incapable of stably integrating into the target cell 
genome, but because the integrated provirus did not express 
any of the genes encoded within the viral transcriptional 

30 cassette. 

The present invention is capable of overcoming the 
transcriptional block to result in germ- line transgenic 
animals with full expression from the" integrated transgene. 
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In addition to expressing the transgene, the level of 
transcription may still be regulated by controlling the 
supply of ligand to the transgenic animal. The increased 
efficiency of producing transgenic animals by retroviral 
5 infection should open up the way to producing mutant 
animals of a variety of species previously impractical for 
genetic modification because of the potential cost of 
producing a large number of non-positive animals by 
classical methods. 

10 To produce transgenic mice, the following nucleic acid 

constructs are prepared and subsequently injected into 
fertilized eggs: CD3-VbR and a ligand inducible S-gal 
reporter. Two separate lines of transgenic mice are 
generated harboring either a ligand inducible reporter, or 

15 a T-cell specific expression construct of VbR, 
respectively. The former are referred to as reporter mice, 
the latter are referred to as receptor mice, and double 
transgenic mice are referred to as receptor/reporter mice. 
Constructs CD3-VbR are injected, while the reporter is 

2 0 injected alone. Primary genotyping is performed by 

Southern blot analysis and the transmission of transgenic 
mice is monitored by dot blot analysis. Receptor mice are 
analyzed for VbR expression by Northern blot analysis of 
RNA collected from these mice. For Northern blot analysis, 
25 15/^g of total RNA obtained from the thymus, and various 
tissues as a control, is run on a denaturing gel and 
blotted onto a nitrocellulose membrane. The blot is probed 
with a radiolabeled S-gal -specific probe and exposed on 
film for 2 days. In addition, the transgene can be 

3 0 transferred to the offspring as expected by Mendelian 

genetics. 
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Example 31 

Use of rerroviral constructs in the invention for 
efficient gene transfer to developing embrvos 

Since the present invention can effectively overcome 
5 the block of viral expression in embryonic cells, invention 
constructs are a potent tool in the delivery of transgenes 
to somatic tissues of a developing embryo. With many 
diseases, considerable damage is done during embryonic 
development so that therapies applied after birth are 
10 essentially ineffective to ameliorate the disease 
phenotype . 

The present invention provides methodology where one 
can infect cells of the embryo and can provide therapeutic 
factors to the developing fetus either constitutively , or 
15 under the regulation of exogenously produced ligand. 

Example 32 
Use of vector constructs in invention 
with inducible high titers 

One obstacle to the use of retroviruses as gene 
20 transfer agents is that titers of retroviruses from 
existing producer cell lines are only on the order of 1X10^ 
or 1X10^. By using a retroviral construct of the invention 
having intact enhancers and regulatory elements, expression 
of the retrovirus may be induced by greater than ten- fold, 
25 resulting in correspondingly higher titers of infectious 
virus . 

While the invention has been described in detail with 
reference to certain preferred embodiments thereof, it will 
be understood that modifications and variations are within 
30 the spirit and scope of that which is described and 
claimed. 
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That which is claimed is: 

1. A nucleic acid construct encoding a receptor 
peptide comprising: 

(i) a ligand binding domain and hinge region of a 
non-mammalian member of the nuclear receptor 
superf amily 

(ii) an activation domain, and 

(iii) a DNA binding domain; 

wherein said receptor peptide activates regulatory 
element (s) in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 

2 . A nucleic acid construct according to claim 1 
wherein said ligand for said ligand binding domain is a 
non-steroidal compound not normally present in a 
mammalian cell. 

3. A nucleic acid construct according to claim 1, 
wherein said ligand binding domain and hinge region are 
derived from a lepidopteran species. 

4. A nucleic acid construct according to claim 1, 
wherein said ligand binding domain and hinge region are 
derived from hombyx mori . 

5 . A nucleic acid construct according to claim 1 
wherein said ligand binding domain has substantial 
sequence identity with amino acid sequence 3 93-508, as 
set forth in SEQ ID NO: 3. 

6 . A nucleic acid construct according to claim 1 
wherein said ligand binding domain has substantial 
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sequence identity with amine acid sequence 393-585; as 
sec forth in SEQ ID NO: 3. 

7. A nucleic acid construct according to' claim 1 
wherein said hinge region has substantial sequence 
identity with amino acid sequence 273-362, as set forth 
in SEQ ID N0:3 . 

8 . A nucleic acid construct according to claim 1 
wherein said hinge region has substantial sequence 
identity with amino acid sequence 273-362, as set forth 
in SEQ ID N0:3. 

9 . A nucleic acid construct according to claim 1 
wherein said DNA binding domain is obtained from a DNA- 
binding protein. 

10. A nucleic acid construct according to claim 9 
wherein said DNA binding domain is obtained from a member 
of the nuclear receptor superfamily. 

11. A nucleic acid construct according to claim 10 
wherein said DNA binding domain is non- mammalian. 

12. A nucleic acid construct according to claim 1 
wherein said receptor peptide has substantial sequence 
identity with the amino acid sequence set forth in SEQ ID 
NO 5. 

'13. A nucleic acid construct according to claim 1, 
wherein said nucleic acid construct further comprises a 
regulatory element operatively associated with exogenous 
nucleic acid (s) . 
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14. A nucleic acid construct according to claim 13 
wherein said regulatory element comprises a hormone 
response element . 



15. A nucleic acid construct according to claim 14 
wherein said construct comprises a plurality of hormone 
response elements . 

16. A nucleic acid construct according to claim 13 
wherein said exogenous nucleic acid(s) is (are) selected 
from wild type and therapeutic nucleic acid(s) . 

17. A nucleic acid construct according to claim 16 
wherein said wild t^-pe nucleic acid(s) is (are) selected 
from genes which encode products: 

the substantial absence of which leads to the 
occurrence of a non-normal state in said host; or 

a substantial excess of which leads to the 
occurrence of a non-normal state in said host. 

18. A nucleic acid construct according to claim 17, 
wherein said exogenous nucleic acid is a CAG repeat 
encoding a polyglutamine polypeptide. 

19. A nucleic acid construct according to claim 16 
wherein said therapeutic nucleic acid{s) is (are) 
selected from those which encode products: 

which are toxic to the cells in which they are 
expressed; or 

which impart a beneficial property to said 

host . 

20. A nucleic acid construct according to claim 19 
wherein said therapeutic nucleic acid(s) modulates the 
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transcripuion and/or translation of endogenous nucleic 
acid (s) . 

21. A nucleic acid construct according zo claim 1, 
further comprising nucleic acid encoding a selectable 
marker. 

22. A nucleic acid construct according to claim 21 
wherein said selectable marker is a radiolabeled 
molecule, a fluorescent molecule, an enzyme, or a ligand. 

23 . A nucleic acid construct according to claim 22 
wherein said enzyme is an antibiotic resistance gene or a 
gene which enables cells to process a metabolic 
intermediary. 

24 . An expression cassette comprising a promoter 
operatively associated with a nucleic acid construct 
according to claim 1. 

25. An expression cassette according to claim 24, 
wherein said promoter is selected from inducible, 
constitutive and/or tissue specific promoters. 

26. A viral vector capable of ligand-mediated 
expression, wherein said vector comprises an expression 
cassette according to claim 24. 

27. A viral vector according to claim 26, wherein 
said viral vector further comprises a retroviral psi {^) 
packaging signal and a 5' and/or a 3' long terminal 
repeat (LTR) . 
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28. An animal cell in culture which is transformed 
with a nucleic acid construct according to claim l. 



29. A cell according to claim 28, wherein said cell 
is further transformed with a reporter vector comprising 
a regulatory element operatively associated with 
exogenous nucleic acid{s) . 



30. An animal cell in culture which is transformed 
with a nucleic acid construct according to claim 13 . 

31. An animal cell according to claim 30, wherein 
said cell is an embryonic stem cell. 

32. A transgenic animal, wherein cells of said 
animal is transfected with a nucleic acid construct 
according to claim 1. 



33. A transgenic animal, wherein cells of said 
animal is transfected with a nucleic acid construct 
according to claim 13 . 



34, A polypeptide receptor encoded by a nucleic 
acid construct according to claim 1. 

35. A nucleic acid construct encoding a receptor 
peptide comprising : 

(i) a ligand binding domain and hinge region of a 
non-mammalian member of the nuclear receptor 
superf amily, and 

(ii) a DNA binding domain, wherein said DNA binding 
domain is not derived from said non-mammalian 
member; 

wherein said receptor peptide activates regulatory 
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element, (s) in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 

36. A nucleic acid construct according to claim 35, 
wherein said receptor peptide further comprises: 

(iii) an activation domain. 

37. A nucleic acid construct according to claim 35, 
wherein said nucleic acid construct further comprises a 
reaulatory element operatively associated with exogenous 
nucleic acid{s). 

38. An expression cassette comprising a promoter 
operatively associated with a nucleic acid construct 
according to claim 35. 

39. A viral vector capable of ligand-mediated 
expression, wherein said vector comprises an expression 
cassette according to claim 38. 

40. An animal cell in culture which is transformed 
with a nucleic acid construct according to claim 35. 

41. A cell according to claim 40, wherein said cell 
is further transformed with a regulatory element 
operatively associated with exogenous nucleic acid(s). 

42. A polypeptide receptor comprising: 
a VP16 activation domain, 

a DNA binding domain, and 

bombyx mori- derived ligand binding domain and hinge 
region. 
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43 . A nucleic acid construct comprising a 
regulatory element derived from a bombyx receptor 
response element operatively associated with exogenous 
nucleic acid(s) . 

44 . An animal cell in culture which is transformed 
with a nucleic acid construct according to claim 43. 

45. An animal cell according to claim 44, wherein a 
polypeptide receptor is further introduced into said 
cell, said receptor comprising: 

an activation domain, 
a DNA binding domain, 

and the ligand binding domain and hinge region of a 
non-mammalian member of the nuclear receptor 
superf amily , 

wherein said receptor peptide activates regulatory 
element in the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for said ligand 
binding domain. 

46. A transgenic nonhuman mammal, wherein cells of 
said mammal are transfected with the nucleic acid 
construct according to claim 43. 

47. A method for modulating the transcription of 
exogenous nucleic acid(s) in a host containing: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid{s) under the 
control of a regulatory element; and 

(ii) a receptor peptide comprising a DNA binding 
domain, and the ligand binding domain and hinge 
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receptor superfamily which is not normally 
present in the cells of said host, wherein said 
receptor peptide activates said regulatory 
element in the absence of an exogenous dimer 
partner therefor and in the presence of a 
ligand for said ligand binding domain, 

said method comprising administering to said host an 
amount of ligand effective to modulate the transcription 
of said exogenous nucleic acid{s); wherein ligand is not 
normally present in the cells of said host . 

48. A method according to claim 47 wherein said 
receptor peptide is provided by a nucleic acid construct 
encoding said receptor peptide. 

49. A method according to claim 48 wherein said 
receptor peptide is expressed under the control of a 
tissue specific promoter. 

50. A method according to claim 47 wherein said 
exogenous nucleic acid(s) is (are) selected from wild 
type and therapeutic nucleic acid{s). 

51. A method according to claim 50 wherein said 
wild type nucleic acid(s) is (are) selected from genes 
which encode products: 

the substantial absence of which leads to the 
occurrence of a non-normal state in said host; or 

a substantial excess of which leads to the 
occurrence of a non-normal state in said host. 
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52. A method according to claim 50 wherein said 
therapeutic nucleic acid(s) is (are) selected from those 
which encode products: 

which are toxic to the cells in which they are 
expressed; or 

which impart a beneficial property to said 

host . 

53. A method according to claim 50 wherein said 
therapeutic nucleic acid(s) modulates the transcription 
and/or translation of an endogenous nucleic acid(s). 

54. A method according to claim 47 wherein said 
regulatory element is not normally present in the cells 
of said host. 

55. A method according to claim 47 wherein said DNA 
binding domain is obtained from a DNA-binding protein. 

56 . A method according to claim 55 wherein said DNA 
binding domain is obtained from a member of the nuclear 
receptor superfamily. 

57. A method according to claim 55 wherein said DNA 
binding domain is a GAL4 DNA binding domain. 

58. A method according to claim 47 wherein said 
receptor peptide further comprises an activation domain. 

59. A method according to claim 47 wherein said 
host further contains a non-mammalian transact ivator not 
normally present in the cells of said host and a 
transactivator responsive regulatory element not normally 
present in cells of said host. 
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wherein said nransactivator responsive 
regulatory element controls transcription of said 
exogenous nucleic acid(s) or a second nucleic acid 
construct comprising a second exogenous nucleic 
acid (s) . 

60. A method according to claim 59 wherein said 
exogenous nucleic acid(s) is (are) under the further 
control of said transactivator responsive regulatory 
element . 

61. A method according to claim 59 wherein said 
transactivator responsive regulatory element controls 
transcription of a second DNA construct comprising a 
second exogenous nucleic acid(s). 

62. A method according to claim 59 wherein said 
transactivator responsive regulatory element comprises an 
operator which is responsive to said non-mammalian 
transactivator which confers responsiveness to 
antibiotics . 

63. A method according to claim 59 wherein said 
transactivator is a ligand-mediated receptor which 
confers responsiveness to antibiotics. 

64. A method according to claim 47 wherein said 
hinge region has substantial sequence identity with amino 
acid sequence 273-362, as set forth in SEQ ID NO: 3. 

65. A method according to claim 47 wherein said 
ligand binding domain has substantial sequence identity 
with am.ino acid sequence 3 93-508, as set forth in SEQ ID 
NO : 3 . 
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66. A method according to claim 47 wherein said 
ligand binding domain has substantial sequence identity 
with amino acid sequence 393-586, as set forth in SEQ ID 
NO : 3 . 

67. A method according to claim 47 wherein said 
receptor peptide has substantial sequence identity with 
the sequence set forth in SEQ ID NO: 3. 

68. A method according to claim 47 wherein said 
receptor peptide has the sequence set forth in SEQ ID 
N0:3 . 

69. A method according to claim 47 wherein said 
ligand for said ligand binding domain is a non-steroidal 



70. A method according to claim 69 wherein said 
non-steroidal compound is a hydrazine. 

71. A method according to claim 70 wherein said 
hydrazine is a 1,2-diacyl hydrazine, an N' -substituted- 
N, N' -disubstituted hydrazine, a dibenzoylalkyl 
cyanohydrazine, an N-substituted-N-alkyl-N, N-diaroyl 
hydrazine, an N-substituted-N~acyl-N-alkyl , carbonyl 
hydrazine, and an N-aroyl-N' -alkyl-N' -aroyl hydrazine. 

72 . A method according to claim 71 wherein said 
hydrazine is a diacyl hydrazine. 

73 . A method of inducing the transcription of an 
exogenous nucleic acid(s) in a host containing: 

(i) a nucleic acid construct comprising a promoter 
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and said exogenous nucleic acid(s) under the 
contrcl of a regulatory element:; 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
the cells of said host, wherein expression of 
said receptor peptide is under the control of 
an inducible promoter, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for 
said ligand binding domain, and 

(iii) said ligand for said ligand binding 
domain, wherein said ligand is not 
normally present in the cells of said 
host ; 



said method comprising subjecting said host to 
conditions suitable to induce expression of said receptor 
peptide . 

74. A method of inducing the transcription of an 
exogenous gene in a host containing a nucleic acid 
construct comprising a promoter and said exogenous 
nucleic acid(s) under the control of a regulatory 
element; said method comprising introducing into a host: 



a receptor peptide comprising a DNA binding 
domain, and the ligand binding domain and hinge 
region of a non-mammalian member of the nuclear 
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receptor superfamily which is not normally present 
in the cells of said host, wherein said receptor 
peptide activates said regulatory element in the 
absence of an exogenous dimer partner therefor and 
in the presence of a ligand for said ligand binding 
domain, and 

a ligand for said ligand binding domain. 

75 . A method for the treatment of a host in need of 
gene therapy, said method comprising: 

introducing into cells of said host : 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid(s) under the 
control of a regulatory element ; 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not no2rmally present in 
the cells of said host, wherein expression of 
said receptor peptide is under the control of 
an inducible promoter, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for 
said ligand binding domain; and 

administering, to said host, ligand for said ligand 
binding domain. 

76. A method for the treatment of a host in need of 
gene therapy, said method comprising: 
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inrroducing into cells obtained from said host : 

(i) a nucleic acid construct comprising a promoter 

and exogenous nucleic acid{s) under the control 

of a regulatory element; and 



(ii) nucleic acid endocing a receptor peptide 

comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
the cells of said host, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer' partner 
therefor and in the presence of a ligand for 
said ligand binding domain. 



to provide modified cells; 

reintroducing the modified cells into said host, and 
administering, to said host, ligand for said ligand 
binding domain. 

77. A method for producing transgenic animals 
capable of prolonged and regulated transcription of 
exogenous nucleic acid(s), said method comprising 
introducing into early-stage embryos or stem cells: 

(i) a nucleic acid construct comprising a promoter 
and said exogenous nucleic acid{s) under the 
control of a regulatory element; 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- • 
mammalian member of the nuclear receptor 
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superfamily which is not normally present in 
the cells of said transgenic animal, wherein 
said receptor peptide activates said regulatory 
element in the absence of an exogenous dimer 
partner therefor and in the presence of a 
ligand for said ligand binding domain. 

78. A method for the expression of recombinant 
products detrimental to a host organism, said method 
comprising : 

transforming suitable host cells with: 

(i) a nucleic acid construct comprising a promoter 
and exogenous nucleic acid(s) which express 
said recombinant product under the control of a 
regulatory element; wherein said regulatory 
element is not normally present in the cells of 
said host, and 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
the cells of said host, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner for 
said ligand binding domain, and 

growing said host cells to the desired level in the 
substantial absence of ligand for said receptor peptide; 
and 

inducing expression of said recombinant product by 
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introducing into said hosr cells' said ligand, which, in 
combination with said receptor peptide, binds to said 
regulatory element and activates transcription therefrom. 

75. A method according to claim 78, wherein said 
exogenous nucleic acid is a CAG repeat which encodes a 
polyglutamine polypeptide. 

80. A method for identifying compounds useful for 
treating neuodegenerative conditions, said method 
comprising : 

introducing a test compound into a host cell 
transformed according to claim 79,. and 

identifying those test compounds that target 
intracellular aggregates of polyglut amines as compounds 
useful for treating neuodegenerative conditions. 

81. A method for modulating the transcription of 
nucleic acid(s) in an in vitro system, said method 
comprising administering to said system an amount of said 
ligand effective to modulate the transcription of said 
nucleic acid(s); wherein said ligand is not normally 
present in said cellular system; wherein said system 
comprises : 

(i) a nucleic acid construct comprising a promoter 
and said nucleic acid(s) under the control of a 
regulatory element; and 

(ii) nucleic acid encoding a receptor peptide 
comprising a DNA binding domain, and the ligand 
binding domain and hinge region of a non- 
mammalian member of the nuclear receptor 
superfamily which is not normally present in 
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the cells of said host, wherein said receptor 
peptide activates said regulatory element in 
the absence of an exogenous dimer partner 
therefor and in the presence of a ligand for 
said ligand binding domain. 

82. A nucleic acid construct comprising nucleic 
acid encoding a VP16 activation domain, a DNA binding 
domain and hoinhyx mori- derived ligand binding domain and 
hinge region. 

83 . A nucleic acid construct comprising a promoter 
and nucleic acid encoding a tetracycline-controlled 
transact ivat or , a VP16 activation domain, a DNA binding 
domain and bomhyx mori -derived ligand binding domain and 
hinge region, in operative association with one another. 

84. A vector comprising: 

a first reporter under the control of a regulatory 
element, 

a second reporter under the control of an operator 
which is responsive to a ligand-mediated receptor which 
confers responsiveness to antibiotics, and 

a nucleic acid construct comprising a promoter 
operatively associated with nucleic acid encoding a 
tetracycline-controlled transactivator, a VP16 activation 
domain, a DNA binding domain and bombyx mori -derived 
ligand binding domain. 



vJSDOCID:<WO 9902d83A1 I > 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 



1/24 



Mlu 



DBD 



LBD 



bR 



VP16 


DBD 




LBD 



Figure 1A 

Asc I Ml u I 



Asc i Ml u 



VbR 



REPLACEMENT 

BstBI-CGGAGCTGCCATGTGAA-BCl 







KO 




KO 


VP16 


1 DBD 




LBD 





VbRRA 



BstBI-CGACCCTCACCATCGCC-Bcl 





KO \^ ^ KO 


VP16 




DBD 




LBD 



VbRTR 



Figure 1 B 



Ml u 







TTA 


VP16 




DBD 




LBD 



TTMT 



Tet-responsive 
region 



Bombyx Mori derived 
Responsive Region 



Figure 1C 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 



2/24 



600 -, 



500 - 



400 - 



300 - 



200 - 



100 - 



□ - Lig 
■ + Lig 



10000 



LU 
> 
X 

E 
o 



LU 
> 
CD 



Cl 

o 
o 



1000 



a: 



100 . 



10. 




Figure 2A 



Figure 2B 



^ISDOCID <WO 990P6a3Al I > 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 



3/24 



3 
-J 



150000 - 



125000 - 



100000 - 



75000 - 



50000 - 



25000 



20000 n 



□ -Lig 
■ +Teb 




3 10000 



□ -Lig 
■ +Dox 



ii.njiji. 

it: UJ h- > 

Z CD ^ CN 
=J > h h- 



Figure 2C 



Figure 2D 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/)4215 



4/24 



20 n 




6E TetO - Luc 4EcrE - Bgal 

Figure 2E 



JSOOCID <WO__ 9902683A1 I > 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 



5/24 



200 n 




VBE-GH clone 3 

Figure 3 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 



6/24 



X 



CD 

c 

CD 
D) 

C 
CD 



LU 



O 
CD 

2 



LU 
> 



c 

CD 
C/) 

c 

CO 



I 

CD 
LLI 
Z 
LU 

> 



o 

CD 



CO 



CD 

c 

CD 
D) 
CO 

c 

CD 



LU 

Z 
LU 



C/) 
CD 
C 
CD 
D) 

C 
CD 



< 



LU 



CD 
CD 
C 
CD 
O) 
if) 
c 

CD 



CD 
LU 



O 
CD 

CL 



NSDOCID <WO 99026e3Ai t > 



SUBSTITUTE SHEET (RULE 26) 



wo 99/C26S3 



PCT/US98/14215 



7/24 




SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 



8/24 



I + + 

□ s ■ 



r -I 



L 



0) 



L 



en 

X 



Q. 
3 



X 

>> 
n 

s 

o 
ffl 



o 



Q. 

(A 

3 



o 
o 



in 



o 
to 



to 

CM 



(0 



a 
o 

(A 
P 



< 
CO 

.5' 
iZ 



NSDOCID <WO 9902683A1 I 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 




SUBSTITUTE SHEET (RULE 26) 



PCT/US98/14215 



10/24 




\'SrcCID:<WO 9902683A1 i > 



SUBSTTTUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 



11/24 




SUBSTITUTE SHEET (RULE 26) 



WQ 99/02683 



PCT/US98/14215 



12/24 



Q 
PQ 



□ □ 



<l<l< 



-PL] 



■ ■ 





PQ 
< 





PQ 



00 

0 

1— 
3 

LL 



NSDCX:iD: <WO 99026a3A1 I 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 



13/24 



X 



LU 

> 



o 



00 

5- 
o 





UJ 








O 






< 








o 

1- 


1 



o 



CO 



LU 
> 



O 



00 



LU 
DQ 
> 



o 



00 



O 



00 



LU 
> 



LU 
O 

O 



in 
> 
o 



LU 
CO 
> 

o 



X 

o 



CO 



< 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 




4SDOClD <WO 9902683A1 I > 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US9S/14215 




SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 




JSDOCID <WO 9S02683A1 I > 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14:i5 



17/24 




SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/TJS98/14215 



18/24 




4SDOCID <WO 9902683A1 I 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



19/24 



PCT/US9S/14215 




SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



20/24 



PCT/US98/14215 




SUBSTirUTE SHEET (RULE 26) 

ISDOCID: <W0_ 9902683A1 t > ^ 



wo 99/02683 



PCT/US98/14215 



21/24 

B3 - 2V Retrovirus organization 



ILTR 



Neo 1 


6e 


Transgene 


CMV 


VBE 


LTR 



0 = VBE 
0 z RXR 

j j z Transgene product 
- Ligand 



LTR 



Neo 



Transgene 


|CMV 


VBE 


LTR 



+ Ligand 



•0 
•0 



LTR 




Neo 






1 

j Transgene 


CMV| 1 VBE 


LTR 












Figure 14 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 



22/24 

Poly - Q expression constructs 
polyQ - GFP 



— o 

CO LU 



q: 

o 
o 

LU 




pSPSK - GFP 




I »> Poly Q repeat region from human huntingtin 



o 
o 



Digest PGR product EcoRI/Ncol and insert into 
SPRK - GFP in - frame with the GFP ORF 



CO 



on 

o 
o 
m 



J L 



o 
o 
2 



1 3/97 Qs 



ft 



GFP 



E 

Q. 

a 



Figure 15A 



SDOCI3 <WO 9902683A1 I > 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCTAJS98/14215 



23/24 




T3 

CD 



0) 5 

(/) c 

c o 
.2 



o 



CD 
CO 

? £ 

CO Q_ 

^ -o 
o c 

_ _ 
o CO > 
osz 2 

< I 2 



o 

0) 

> 
15 



CD 

LO 
(D 

■ MP 

LL 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 



24/24 

B3 - V polyQ - GFP constructs to date 



LTR 




Neo 





B3NV 
B3ZV 

B3ZV- 13Q-GFP 
B3NV - 97Q - GFP 
B3NV-13QN - GFP 
B3NV - 97QN - GFP 
B3NV-13NQ-GFP 

B3NV - 97NQ - GFP 

B3ZV-97Q-GFP 

B3ZV - 97QN - GFP 

B3NV - GFP 



6e 



Transgene 



CMV 



WBE 



Zeo 



Zeo 



Zeo 



13 



97 



97 



I 



DEI 



97 



97 



I 



LTR 



Figure 16 



JSDQCiD <WO <?9026fl3A1 I > 



SUBSTITUTE SHEET (RULE 26) 



wo 99/02683 



PCT/US98/14215 



1 



SEQUENCE LISTING 
<110> The Salk Institute for Biological Studies 

<12 0> Modified Lepidopteran Receptors, and 

Hybrid Multi- Functional Proteins for Use in Transcription 
and Transgene Expression Regulation 

<130> SALK2070WO 

<140> PCT/US9B/14215 
<141> 1998-07-10 

<150> 60/091874 
<151> 1998-07-07 

<150> 08/891,298 
<151> 1997-07-10 

<160> 8 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 71 
<:212> PRT 

<213> Artificial Sequence 
<:220> 

<223> DNA binding domain of member of steroid/ thyroid 
hormone receptor super family 

<400> 1 

Cys Xaa Xaa Cys Xaa Xaa Asp Xaa Ala Xaa Gly Xaa Tyr Xaa Xaa Xaa 

1 5 10 15 

Xaa Cys Xaa Xaa C\'S Lys Xaa Phe Phe Xaa Arg Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Xaa Xaa Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys 

35 40 45 

Xaa Xaa Xaa Lys Xaa Xaa Arg Xaa Xaa Cys Xaa Xaa Cys Arg Xaa Xaa 

50 55 60 

Lys Cys Xaa Xaa Xaa Gly Met 
65 70 

<210> 2 
<211> 2711 

<212> DNA 

<213> Bombyx mori 

<220> 
<221> CDS 

<222>. (18.5) . . . (2002) 
<400> 2 

ctccgttgac gacggtcgcg cgtgcggtac gtccgttttt acggctcaaa cgtacacggt 60 
aacctccgtc tctgcatcat cggcggaact cgtgaaattc gcgtgctttt ctcacctgtt 120 
gaacgagttg tgttgtgact gaaaaacatc atcacaaata tcaagcttca aaactaataa -IBO 
gtga atg aga gtc gag aac gtg gat aac gta teg ttt get ttg aac gga 229 
Met Arg Val Glu Asn Val Asp Asn Val.Ser Phe Ala Leu Asn Gly 
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10 15 



cgc get gac gag tgg tgt atg tct gta gag acg cgt tta gat agt tta 277 
Arg Ala Asp Glu Trp Cys Met Ser Val Glu Thr Arg Leu Asp Ser Leu 
20 25 30 



gtg cga gaa aaa agt gaa gtg aaa gcc tac gtc gga gga tgt ccc teg 
Val Arg Glu Lys Ser Glu Val Lys Ala Tyr Val Gly Gly Cys Pro Ser 
35 40 45 



eta tgg age tac gat gac gga ate act tat aat aca gcc cag tct ctg 
Leu Trp Ser Tyr Asp Asp Gly He Thr Tyr Asn Thr Ala Gin Ser Leu 

120 125 

ggt gca tgc aat atg caa cag caa cag eta caa cot cag caa cca 
Leu Gly Ala Cys Asn Met Gin Gin Gin Gin Leu Gin Pro Gin Gin Pro 
130 135 140 

cat cca gca cca ccg acg etc ccc acg atg cct tta cca atg cct ccc 
His Pro Ala Pro Pro Thr Leu Pro Thr Met Pro Leu Pro Met Pro Pro 
145 150 155 

aca aca ccg aaa tea gaa aat gaa teg atg tea tea ggt cga gag gaa 
Thr Thr Pro Lys Ser Glu Asn Glu Ser Met Ser Ser Gly Arg Glu Glu 
ISO 165 170 175 



aga egg cag aag aaa ggt cct gca cct cga cag caa gag gag eta tgt 
Arg Arg Gin Lys Lys Gly Pro Ala Pro Arg Gin Gin Glu Glu Leu Cys 
195 200 205 

ett gtc tgc ggc gac aga gcc tec gga tac cac tac aac gca ctg acg 
Leu Val Cys Gly Asp Arg Ala Ser Gly Tyr His Tyr Asn Ala Leu Thr 
210 215 220 

tgt gaa gga tgc aaa gga ttc ttq agg egg agt gtc acc aaa aac gca 
Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Ser Val Thr Lys Asn Ala 
225 230 235 

gta tat att tgt aaa ttt gga cat gcc tgt gaa atg gat atg tac atg 
Val Tyr He Cys Lys Phe Gly His Ala Cys Glu Met Asp Met Tyr Met 
240 245 250 255 



325 



gta ate acg gat get gga gcg tat gac gcg etc ttc gac atg aga cgc 373 
Val He Thr Asp Ala Gly Ala Tyr Asp Ala Leu Phe Asd Met Arg Arc 
50 55 6*0 

cgc tgg tct aat aac ggt ggc ttc ccg ctg cga atg ett gaa gag age 421 
Arg Trp Ser Asn Asn Gly Gly Phe Pro Leu Arg Met Leu Glu Glu Ser 
65 70 75 

tct tea gaa gtg aca teg tct teg gca ctg ggt ttg cca ccg gcc atg 469 
Ser Ser Glu Val Thr Ser Ser Ser Ala Leu Gly Leu Pro Pro Ala Met 
80 85 90 95 

gtt atg teg ccg gaa tec ttg gcg teg ccc gag tat cga gee etc gag 517 
Val Met Ser Pro Glu Ser Leu Ala Ser Pro Glu "Tyr Arg Ala Leu Glu 
100 105 110 



565 



613 



661 



709 



ett teg ccg get tea age ata aat ggc tgc agt get gat get gac gcc 757 
Leu Ser Pro Ala Ser Ser He Asn Gly Cys Ser Ala Asp Ala Asp Ala 
180 185 190 



605 



853 



901 



949 
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agg agg aaa tgr caa gag cgt cga tta aag aaa tgz eta gcg oca gga 9S7 
Arg Arg Lys Cys Gin Glu Cys Arg Leu Lys Lys Cys Leu Ala Val Gly 
" " ' ' 260 ' 265 270 

atg agg ccz gaa tgt gcc ata cag gag ccc agt aaa aat aaa gac agg 1045 
Mez Arg Pro Glu Cys Val He Gin Glu Pro Ser Lys Asn Lys Asp Arg 
275 280 285 

caa aga caa aag aaa gac aaa gga ata tta tta cct gtt agt acg acc 1093 
Gin Arg Gin Lys Lys Asp Lys Gly He Leu Leu Pro Val Ser Thr Thr 
290 295 300 

aca gtc gaa gac cac atg ccc ccg ate atg caa tgt gat cca cct ccg 1141 
Thr Val Glu Asp Kis Met Pro Pro He Met Gin Cys Asp Pro Pro Pro 
305 310 315 

ccc gag gcc gcc agg att cac gaa gtc gtc ccg agg tat ctt teg gag 1189 
Pro Glu Ala Ala Arg He His Glu Val Val Pro Arg Tyr Leu Ser Glu 
320 325 330 335 

aag ctg atg gag cag aac agg cag aag aac ata cca cca ttg teg gcg 1237 
Lys Leu Met Glu Gin Asn Arg Gin Lys Asn He Pro Pro Leu Ser Ala 
340 345 350 

aac cag aag tct ctg ate gcg agg etc gtg tgg tae cag gag gga tat 1285 
Asn Gin Lys Ser Leu He Ala Arg Leu Val Trp T\'r Gin Glu Gly 1^^ 
355 360 365 

gag cag ccc tec gac gag gat etc aaa aga gta acg cag act tgg cag 1333 
Glu Gin Pro Ser Asp Glu Asp Leu Lys Arg Val Thr Gin Thr Trp Gin 
370 375 380 

teg gat gaa gag gac gag gaa tec gat eta ccc ttc cgc cag ate acg 1381 
Ser Asp Glu Glu Asp Glu Glu Ser Asp Leu Pro Phe Arg Gin He Thr 
3 8*5 3 90 3 95 

gag atg acg ate tta acg gtc cag ttg ate gtc gag ttc gcc aag ggt 1429 
Glu Met Thr He Leu Thr Val Gin Leu He Val Glu Phe Ala Lys Gly 
400 405 410 415 

eta ccg ggc ttt teg aag ata tea cag tct gat caa ate ace tta tta 1477 
Leu Pro Gly Phe Ser Lys He Ser Gin Ser Asp Gin He Thr Leu Leu 
420 425 430 

aaa gcc teg tec age gag gtg atg atg ctg egg gtg gcg agg cga tae 1525 
Lys Ala Ser Ser Ser Glu Val Met Met Leu Arg Val Ala Arg Arg Tyr 
435 440 445 

gac gcc gcg tec gac age gtg ctg ttc gcc aac aac aag gcg tac acg 1573 
Asp Ala Ala Ser Asp Ser Val Leu Phe Ala Asn Asn Lys Ala Tyr Thr 
450 455 460 

cgc gac aac tac cgc caa ggc ggc atg gee tac gtc ate gaa gac etc 1621 
Arg Asp Asn T>-r Arg Gin Gly Gly Met Ala Tyr Val He Glu Asp Leu 
465 470 475 

eta cac ttc tgc egg tgc atg ttc gcg atg ggc atg gac aat gtg cac 1669 
Leu His Phe Cys Arg Cys Met Phe Ala Met Gly Met Asp Asn Val His 
480 4B5 490 495 

ttt gea ctg etc acg gee ate gtt ata ttc tea gat egg ccc ggg etc 1717 
Phe Ala Leu Leu Thr Ala He Val He Phe Ser Asp Arg Pro Gly Leu 
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500 505 510 

gag cag ccg teg ctg gta gaa gag ate cag aga tac tac ctg aac acg 1765 
Glu Gin Pro Ser Leu Val Glu Glu lie Gin Arg Tyr Tyr Leu Asn Thr 
515 520 " 525 

ttg cga att tac ate ate aac cag aac age gcg teg teg cgc tgc gee 1813 
Leu Arg lie Tyr lie He Asn Gin Asn Ser Ala Ser Ser Arg Cys Ala 
530 535 540 

gtg ate tac ggc agg ate ctg age gtg ctg ace gag eta cgc acg etc 1861 
Val lie Tyr Gly Arg He Leu Ser Val Leu Thr Glu Leu Arg Thr Leu 
545 550 555 

ggc acg caa aac tec aac atg tgc ate teg ctg aag ctg aag aac agg 1909 
Gly Thr Gin Asn Ser Asn Met Cys He Ser Leu Lys Leu Lys Asn Arg 
560 565 570 575 

aag ctg ccg ccg ttc etc gag gag ate tgg gae gtg gcg gag gtg gee 1957 
Lys Leu Pro Pro Phe Leu Glu Glu lie Trp Asp Val Ala Glu Val Ala 
580 585 590 

acg acg cat ccc acg gtg ctg ccg ccc ace aac ccg gtg gtg eta 2002 
Thr Thr His Pro Thr Val Leu Pro Pro Thr Asn* Pro Val Val Leu 
595 600 605 

tagcctccgc cegccccagg agagaacgct catagactgg etagttttag tgaacgtgcg 2062 

ctgatccgta tteggtgaca gattagtgat tatatgtgtt gttgaacgtt tggagagtat 2122 

atatatagtg ttgaeggcga ggcecgtccg gcccegtact tgtttcgttt etgaccggat 2182 

gctgcgtcgg tcgcgccctt gcgaccacga taagactact ttctataagt aegtctctaa 2242 

attgaggccc caagaaccgg aagectaact aagatacgae ggtttatttt atcacagagg 23 02 

aaactgaagt aattaatata ttaacagtta acacaggttc aaagcagtta ggccgctceg 2362 

gacttctcat ggtctcaata aggcggacgt ataagagtta catagagaat aaaataataa 2422 

tatatgaaga gatgtttcta ttggatggaa tgegtgatgt aaagttgaca gttattttta 2482 

tttaccaaat taatgaagcg tcgggtgtag acettttgta tgtgatgtgg cgaggagtgg 2542 

atcgcagtgt cggecgegtg ctcteaccaa aagegtgegg tcgacgctaa tagtgcgatg 2602 

gttttggaat atgtttgttt atatatagtt tatgtgtgag gtgttatcgt gtcccgteaa 2662 

ttttaacttc gattcgcgtt cattcgtcct gtgctcgeta ctcagattt 2711 

<210> 3 

<211> 606 

<212> PRT 

<213> Bombyx mori 

<400> 3 

Met Arg Val Glu Asn Val Asp Asn Val Ser Phe Ala Leu Asn Gly Arg 

15 10 15 

Ala Asp Glu Trp Cys Met Ser Val Glu Thr Arg Leu Asp Ser Leu Val 

20 25 30 

Arg Glu Lys Ser Glu Val Lys Ala Tyr Val Gly Gly Cys Pro Ser Val 

35 40 45 

He Thr Asp Ala Gly Ala Tyr Asp Ala Leu Phe Asp Met Arg Arg Arg 

50 55 60 

Trp Ser Asn Asn Gly Gly Phe Pro Leu Arg Met Leu Glu Glu Ser Ser 
65 70 75 . 80- 

Ser Glu Val Thr Ser Ser Ser Ala Leu Gly Leu Pro Pro Ala Met Val 

85 90 95 

Met Ser Pro Glu Ser Leu Ala Ser Pro Glu Tyr Arg Ala Leu Glu Leu 

100 105 110 

Trp Ser Tyr Asp Asp Gly He Thr Tyr Asn Thr Ala Gin Ser Leu Leu 

115 120 125 

Gly Ala Cys Asn Met Gin Gin Gin Gin Leu Gin Pro Gin Gin Pro His 
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130 

Pro Ala Pro Pro 
145 

Thr Pro Lys Ser 

Ser Pro Ala Ser 
180 

Arg Gin Lys Lys 
195 

Val Cys Gly Asp 
210 

Glu Gly Cys Lys 
225 

Tyr He Cys Lys 

Arg Lys Cys Gin 
260 

Arg Pro Glu Cys 
275 

Arg Gin Lys Lys 
290 

Val Glu Asp His 

305 

Glu Ala Ala Arg 

Leu Met Glu Gin 
340 

Gin Lys Ser Leu 
355 

Gin Pro Ser Asp 
370 

Asp Glu Glu Asp 
385 

Met Thr He Leu 

Pro Gly Phe Ser 
420 

Ala Ser Ser Ser 
435 

Ala Ala Ser Asp 
450 

Asp Asn Tyr Arg 
465 

His Phe Cys Arg 

Ala Leu Leu Thr 
500 

Gin Pro Ser Leu 
515 

Arg He Tyr He 
530 

He Tyr Gly Arg 
54 5 

Thr Gin Asn Ser 

Leu Pro Pro Phe 
580 

Thr His Pro Thr 
595 



13 5 

Thr Leu Pro Thr 
150 

Glu Asn Glu Ser 
165 

Ser He Asn Gly 

Gly Pro Ala Pro 
200 

Arg Ala Ser Gly 
215 

Gly Phe Phe Arg 
23 0 

Phe Gly His Ala 
245 

Glu Cys Arg Leu 

Val He Gin Glu 
280 

Asp Lys Gly He 
295 

Met Pro Pro He 

310 

He His Glu Val 
325 

Asn Arg Gin Lys 

He Ala Arg Leu 
360 

Glu Asp Leu Lys 
375 

Glu Glu Ser Asp 
390 

Thr Val Gin Leu 
405 

Lys He Ser Gin 

Glu Val Met Met 

440 

Ser Val Leu Phe 
455 

Gin Gly Gly Met 
.470 

Cys Met Phe Ala 
485 

Ala He Val He 

Val Glu Glu He 
520 

He Asn Gin Asn 
535 

He Leu Ser Val 
550 

Asn Met Cys He 
565 

Leu Glu Glu He 

Val Leu Pro Pro 
600 



140 

Met Pro Leu Pro 
155 

Met Ser Ser Gly 
170 

C\'s Ser Ala Asp 
185 

Arg Gin Gin Glu 

Tyr His T\'r Asn 
220 

Arg Ser Val Thr 
235 

Cys Glu Met Asp 
250 

Lys Lys Cys Leu 
265 

Pro Ser Lys Asn 

Leu Leu Pro Val 
300 

Met Gin Cys Asp 

315 . 

Val Pro Arg Tyr 
330 

Asn He Pro Pro 
345 

Val Trp Tyr Gin 

Arg Val Thr Gin 
380 

Leu Pro Phe Arg 
395 

He Val Glu Phe 
410 

Ser Asp Gin He 
425 

Leu Arg Val Ala 

Ala Asn Asn Lys 
460 

Ala Tyr Val He 
475 

Met Gly Met Asp 
490 

Phe Ser Asp Arg 
505 

Gin Arg Tyr T^^r 

Ser Ala Ser Ser 
540 

Leu Thr Glu Leu 
355 

Ser Leu Lys Leu 

570 

Trp Asp Val Ala 
585 

Thr Asn Pro Val 



Met Pro Pro Thr 
160 

Arg Glu Glu Leu 

175 . 
Ala Asp Ala Arg 

190 

Glu Leu Cys Leu 
205 

Ala Leu Thr Cys 

Lys Asn Ala Val 
240 

Met Tyr Met Arg 
255 

Ala Val Gly Met 
270 

Lys Asp Arg Gin 
285 

Ser Thr Thr Thr 

Pro Pro Pro Pro 
320 

Leu Ser Glu Lys 
335 

Leu Ser Ala Asn 
350 

Glu Gly Tyr Glu 
365 

Thr Trp Gin Ser 

Gin He Thr Glu 
400 

Ala Lys Gly Leu 
415 

Thr Leu Leu Lys 
430 

Arg Arg Tyr Asp 
445 

Ala Tyr Thr Arg 

Glu Asp Leu Leu 
480 

Asn Val His Phe 
495 

Pro Gly Leu Glu 
510 

Leu Asn Thr Leu 
525 

Arg Cys Ala Val 

Arg Thr Leu Gly 
560 

Lys Asn Arg Lys 

575- / 
Glu Val Ala Thr 

590 
Val Leu 
605 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> GAL4 response element 
<400> 4 

cggaggactg tcctccg 17 

<210> 5 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<400> 5 

gagagaagct tatggcgcgc ccgaccgatg 30 

c210> 6 
<211> 29 

<212> DNA 

c213> Artificial Sequence 
<400> € 

cacacacgcg tgtactcgtc aattccaag 29 

<210> 7 
<211> 1977 
<212> DNA 

<213> Artificial Sequence 
<400> 7 

atggcgcgcc cgaccgatgt cagcctgggg gacgagctcc acttagacgg cgaggacgtg 6 0 

gcgatggcgc atgccgacgc gctagacgat ttcgatctgg acatgttggg ggacggggat 120 

tccccgggtc cgggatttac cccccacgac tccgccccct acggcgctct ggatatggcc 180 

gacttcgagt ttgagcagat gtttaccgat gcccttggaa ttgacgagta cacgcgttta 24 0 

gatagtttag tgcgagaaaa aagrgaagtg aaagcctacg tcggaggatg tccctcggta 300 

atcacggatg ccggagcgta tgacgcgctc ttcgacatga gacgccgctg gtctaataac 360 

ggtggcttcc cgctgcgaat gcttgaagag agctcttcag aagtgacatc gtcttcggca 420 

ctgggtttgc caccggccat ggttatgtcg ccggaatcct tggcgtcgcc cgagtatcga 480 

gccctcgagc tatggagcta cgatgacgga atcacttata atacagccca gtctctgctg 54 0 

ggtgcatgca atatgcaaca gcaacagcta caacctcagc aaccacatcc agcaccaccg 60 0 

acgctcccca cgatgccttt accaatgcct cccacaacac cgaaatcaga aaatgaatcg 660 

atgtcatcag gtcgagagga actttcgccg gcttcaagca taaatggctg cagtgctgat 720 

gctgacgcca gacggcagaa gaaaggtcct gcacctcgac agcaagagga gctatgtctt 780 

gtctgcggcg acagagcctc cggataccac tacaacgcac tgacgtgtga aggatgcaaa 84 0 

ggattcttca ggcggagtgt caccaaaaac gcagtatata tttgtaaatt tggacatgcc 900 

tgtgaaatgg atatgtacat gaggaggaaa tgtcaagagt gtcgattaaa gaaatgtcta 96 0 

gcggtaggaa tgaggcctga atgtgtcata caggagccca gtaaaaataa agacaggcaa 102 0 

agacaaaaga aagacaaagg aatattatta cctgttagta cgaccacagt cgaagaccac 108 0 

atgcccccga tcatgcaatg tgatccacct ccgcccgagg ccgccaggat tcacgaagtc 114 0 

gtcccgaggt atctttcgga gaagctgatg gagcagaaca ggcagaagaa cataccacca 120 0 

ttgtcggcga atcagaagtc tctgatcgcg aggctcgtgt ggtaccagga gggatatgag 1260 

cagccctccg acgaggatct caaaagagta acgcagactt ggcagtcgga tgaagaggac 1320 

gaggaatccg atctaccctt ccgccagatc acggagatga cgatcttaac ggtccagttg 1380 

atcgtcgagt tcgccaaggg tctaccgggc ttttcgaaga tatcacagtc tgatcaaatc 1440 

accttattaa aagcctcgtc cagcgaggtg atgatgctgc gggtggcgag gcgatacgac 1500 

gccgcgtccg acagcgtgct gttcgccaac aacaaggcgt acacgcgcga caactaccgc 156 0 

caaggcggca tggcctacgt catcgaagac ctcctacact tctgccggtg catgttcgcg 1620 

atgggcatgg acaatgtgca ctttgcactg ctcacggcca tcgttatatt ctcagatcgg 1680 

cccgggctcg agcagccgtc gctggtagaa gagatccaga gatactacct gaacacgttg 1.74 0 

cgaatttaca tcatcaacca gaacagcgcg tcgtcgcgct gcgccgtgat ctacggcagg 1800 

atcctgagcg tgctgaccga gctacgcacg ctcggcacgc aaaactccaa catgtgcatc 1860 
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tcgccgaagc cgaagaacag gaagccgccg ccgctcctcc aggagarctg ggargtggcg 
gaggrggcca cgacgcatcc cacggtgccg ccgcccacca acccggcggt gzcacag 

<21Q> e 
<211>"9072 
<212> DNA 

c213> Artificial Sequence 



192 0 
1977 



<400> 8 
tatggaggga cacccgcggc 
ctttcraaat acattcaaac 
aacaacattg aaaaaggaag 
trtttgcggc attttgccct 
atgcrgaaga ccagttgggt 
agatcctcga gagttttcgc 
tgcuatgtgg cgcggtacta 
cacactatcc tcagaatgac 
atggcatgac agtaagagaa 
ccaactcact tccgacaacg 
tgggggatca cgcaactcgc 
acaacgagcg tgacaccacc 
crggcgaacc acctacrtcra 
aagttgcagg accact.tctg 
ctggagccgg t:gagcgcggg 
ccccccgtat cgtagccacc 
gacagatcgc tgagataggt 
actcatacac accttagatt 
agatcct:ttt tgataatctc 
cgccagaccc cgtagaaaag 
tccgcrgcti. gcaaacaaaa 
agctaccaac tctttcrccg 
tccticctagt gtagccgtag 
acctcgctct gctaatcctg 
ccgggttgga ctcaagacga 
gtrcgtgcac acagcccagc 
gtgagctatg agaaagcgcc 
gcggcagggt cggaacagga 
ttcatagtcc tgtcgggttt 
caggggggcg gagcctiatgg 
trrgctggcc cttrgctcac 
gtatcaccgc ctttgagtga 
agtcagtgag cgaggaagcg 
ggccgattca ttaatgcagc 
gcaacgcaat taatgcgagt 
ttccggctcg tatgctgtgt 
atgaccatga ttacgccaag 
cagcggccgc aacaaaacaa 
accccacctg taggtttggc 
acacacaact gagaaragag 
atgggccaaa caggatatct 
atggaacagc cgaatatggg 
cagggccaag aacagacggt 
ggttcaatgc acttgtcgct 
caatgcactt gtcgctagaa 
cagggtgccc caaggacctg 
cctctcgctt cugttcgcgc 
tcacrcgggg cgccagtcct 
aaagcctctt gctgctgcac 
gacrgattga ctgcccacct 
ccctgcccag ggaccaccga 
gtgcctgrct ctgtctccgt 
tactagttag ctaactagct 



acttttcggg gaaatgtgcg cggaacccct acrtgcrtat 
atgnatccgc ccacgagaca ataacccrga taaatgctcc 
agratgagta tucaacattt ccgtgtcgcc ctcattccct 
cccgttcttg ctcacccaga aacgctggtg aaagtaaaag 
gcacgagcgg gttacatcga actggatctc aacagcggta 
cccgaagaac gttttccaat gatgagcact tttaaagttc 
tcccgtatng acgccgggca agagcaactc ggccgccgca 
tcggttgagt acucaccagt cacagaaaag catcttacgg 
ctatgcagtg ctgccacaac cacgagtgat aacaccgcgg 
accggaggac cgaaggagct aaccgccttt ttgcacaaca 
ctcgaccgct gggaaccgga gctgaatgaa gccataccaa 
atgccrgtag caatggcaac aacgccgcgc aaactattaa 
gcttcccggc aacaattaar agactggatg gaggcggata 
cgcccggccc rtccggctgg ct:ggtttatt gctgataaat 
rctcgcggta ccactgcagc actggggcca gatggtaagc 
tiacacgacgg ggagtcaggc aaccacggat gaacgaaata 
gcctcactga utaagcattg gtaaccgcca gaccaagttt 
gacttaaaac ttcattttta atttaaaagg atictaggtga 
atgaccaaaa tcccttaacg tgagtcttcg ttccactgag 
atcaaaggat cttcttgaga tccttctttt ccgcgcgtaa 
aaaccaccgc taccagcggr ggtttgtttg ccggatcaag 
aaggtaactg gcttcagcag agcgcagata ccaaatactg 
ttaggccacc acrtcaagaa ccctgtagca ccgcctacat 
ctaccagtgg ctgctgccag tggcgataag tcgtgcctta 
tagttaccgg ataaggcgca gcggccgggc tgaacggggg 
ttggagcgaa cgacccacac cgaactgaga tacctacagc 
acgcttcccg aagggagaaa ggcggacagg tatccggtaa 
gagcgcacga gggagcttcc agggggaaac gcccggtatc 
cgccacctct gacttgagcg rcgatttttg tgatgctcgc 
aaaaacgcca gcaacgcggc ctttttacgg ttccrggcct 
atgttcrttc ctgcgttatc ccctgarcct gtggaraacc 
gcrgataccg ctcgccgcag ccgaacgacc gagcgcagcg 
gaagagcgcc caatacgcaa accgcctctc cccgcgcgtt 
tggcacgaca ggttccccga ctggaaagcg ggcagcgagc 
tagctcactc attaggcacc ccaggcrtta cactttatgc 
ggaantgrga gcggatiaaca atttcacaca ggaaacagct 
cgcgcaatta accctcacta aagggaacaa aagctggagc 
aagaccttat ttagtcccca gaaaaagggg ggaatgaaag 
aagccagctt aagtaacgcc actctgcaag gcatggaaaa 
aagrtcagat caaggccagg aacagacgga acagccgaat 
gtggtaagca gttcctgccc cggcncaggg ccaagaacag 
ccaaacagga catctgtggt aagcagctcc tgccccggct 
ccccagatgc ggtccagccc ccagcagctt ctagaacaag 
agaacaaggg ttcaacgcac ttgccgccag aacaagggtt 
caagggttca atgcacttgt cgagaaccat cagatgtttc 
aaatgacccr gcgccctacc tgaactaacc aatcagttcg 
gcttctgccc cccgagccca ataaaagagc ccacaacccc 
ccgatcgact gagacgcgtc ggggtacccg cattcccaat 
ccgaatcgtg gactcgcnga tccttgggag ggtctcctca 
cgggggtctt tcatcrggag grtccaccga gattggagac 
ccccccccgc cgggaggtaa gccggccagc ggccggtttc 
gcgtgtttgt gccggcatcc aatgtttgcg cctgcgtctg 
ctgcatctgg cggacccgcg gtggaactga cgagttctga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 . 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
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acacccggcc gcaaccctgg gagacgtccc agggactttg ggggccgttt ttgtggcccg 324 0 

acctgaggaa gggagtcgat gtggaatccg accccgtcag gatatgtggc tctggtagga 3 3 00 

gacgagaacc taaaacagtt cccgcctccg tctgaatttt tgctttcggt ttggaaccga 3360 

agccgcgcgt cttgtctgct gcagcatcgt tctgtgttgt ctctgtctga ctgtgtttct 3420 

gtatttgtct gaaaattagg gccagactgt taccactccc ttaagtttga ccttaggtca 3480 

ctggaaagat gtcgagcgga tcgctcacaa ccagtcggta gatgtcaaga agagacgttg 3540 

ggttaccttc tgctctgcag aatggccaac ctttaacgtc ggatggccgc gagacggcac 3600 

ctttaaccga gacctcatca cccaggttaa gatcaaggtc ttttcacccg gcccgcatgg 3660 

acacccagac caggtcccct acatcgtgac ctgggaagcc ttggcttttg acccccctcc 3720 

ctgggtcaag cccttcgtac accctaagcc tccgcctcct cttcctccat ccgccccgtc 3780 

tctccccctt gaacctcctc gttcgacccc gcctcgatcc tccctttatc cagccctcac 3840 

tcctcctcta ggcgccggaa tticggcgcgt gtggcctcga acaccgagcg accctgcagc 3 900 

gacccgctta acagcgtaag cttgtcgacg gtatcgataa gcttatggcg cgcccgaccg 3 960 

atgtcagcct gggggacgag ctccacttag acggcgagga cgtggcgatg gcgcatgccg 4020 

acgcgctaga cgatttcgat ctggacatgt tgggggacgg ggattccccg ggtccgggat 4080 

ttacccccca cgactccgcc ccctacggcg ctctggatat ggccgacttc gagtttgagc 4140 

agatgtttac cgatgccctt ggaattgacg agtacacgcg tttagatagt ttagtgcgag 4200 

aaaaaagtga agtgaaagcc tacgtcggag gatgtccctc ggtaatcacg gatgctggac 4260 

cgtatgacgc gctcttcgac atgagacgcc gctggtctaa taacggtggc ttcccgctgc 4320 

gaatgcttga agagagctct tcagaagtga catcgtcttc ggcactgggt ttgccaccgg 4380 

ccatggttat gtcgccggaa tccttggcgt cgcccgagta tcgagccctc gagctatgga 444 0 

gctacgatga cggaatcact tataatacag cccagtctct gctgggtgca tgcaatatgc 4500 

aacagcaaca gctacaacct cagcaaccac atccagcacc accgacgctc cccacgatgc 4560 

ctttaccaat gcctcccaca acaccgaaat cagaaaatga atcgatgtca tcaggtcgag 4620 

aggaactttc gccggcttca agcacaaatg gctgcagtgc tgatgctgac gccagacggc 4680 

agaagaaagg tcctgcacct cgacagcaag aggagctatg tcttgtctgc ggcgacagag 4740 

cccccggata ccactacaac gcactgacgt gtgaaggatg caaaggattc ttcaggcgga 4 800 

gtgtcaccaa aaacgcagta tatatttgta aatttggaca tgcctgtgaa atggatatgt 4 860 

acatgaggag gaaatgtcaa gagtgtcgat taaagaaatg tctagcggta ggaatgaggc 4920 

ctgaatgtgt catacaggag cccagtaaaa ataaagacag gcaaagacaa aagaaagaca 4 980 

aaggaatatt attacctgtt agtacgacca cagtcgaaga ccacatgccc ccgatcatgc 504 0 

aatgtgatcc acctccgccc gaggccgcca ggattcacga agtcgtcccg aggtatcttt 5100 

cggagaagct gatggagcag aacaggcaga agaacatacc accattgtcg gcgaatcaga 5160 

agtctctgat cgcgaggctc gtgtggtacc aggagggata tgagcagccc tccgacgagg 5220 

atctcaaaag agtaacgcag acttggcagt cggatgaaga ggacgaggaa tccgatctac 5280 

ccttccgcca gatcacggag atgacgatct taacggtcca gttgatcgtc gagttcgcca 5340 

agggtctacc gggcttttcg aagatatcac agtctgatca aatcacctta ttaaaagcct 5400 

cgtccagcga ggtgatgacg ctgcgggtgg cgaggcgata cgacgccgcg tccgacagcg 5460 

tgctgttcgc caacaacaag gcgtacacgc gcgacaacta ccgccaaggc ggcatggcct 5520 

acgtcatcga agacctccta cacttctgcc ggtgcatgtt cgcgatgggc atggacaatg 5580 

tgcactttgc actgctcacg gccatcgtta tattctcaga tcggcccggg ctcgagcagc 5640 

cgtcgctggt agaagagatc cagagatact acctgaacac gttgcgaatt tacatcatca 5700 

accagaacag cgcgtcgtcg cgctgcgccg tgatctacgg caggatcctg agcgtgctga 5760 

ccgagctacg cacgctcggc acgcaaaact ccaacatgtg catctcgctg aagctgaaga 5820 

acaggaagct gccgccgttc ctcgaggaga tctgggacgt ggcggaggtg gccacgacgc 5880 

atcccacggt gctgccgccc accaacccgg tggtgctata gcctccgccc gccccaggag 5940 

agaacgctca tagactggct agttttagtg aacgtgcgct gatccgtatt cggtgacaga 6000 

ttagtgatta tatgtgttgt tgaacgtttg gagagtatat atatagtgtt atcgaattcc 6060 

gcccctctcc ctcccccccc cctaacgtta ctggccgaag ccgcttggaa taaggccggt 6120 

gtgcgtttgt ctatatgtta ttttccacca tattgccgtc ttttggcaat gtgagggccc 6180 

ggaaacctgg ccctgtcttc ttgacgagca ttcctagggg tctttcccct ctcgccaaag 6240 

gaatgcaagg tctgttgaat gtcgtgaagg aagcagttcc tctggaagct tcttgaagac 6300 

aaacaacgtc tgtagcgacc ctttgcaggc agcggaaccc cccacctggc gacaggtgcc 6360 

tctgcggcca aaagccacgt gtataagata cacctgcaaa ggcggcacaa ccccagtgcc 6420 

acgttgtgag ttggatagtt gtggaaagag tcaaatggct cccctcaagc gtattcaaca 64 80 

aggggcngaa ggatgcccag aaggtacccc attgtatggg atctgatctg gggcctcggt 6540 

gcacatgctt tacgtgtgtt tagtcgaggt taaaaaacgt ctaggccccc cgaaccacgg 6600 

ggacgtggtt ttcctttgaa aaacacgatg ataatatggc cacaaccatg gcatgattga 6660 

acaagatgga ttgcacgcag gttctccggc cgcttgggtg gagaggctat tcggctatga 6720 

ctgggcacaa cagacaatcg gctgctctga tgccgccgtg ttccggctgt cagcgcaggg 6780 

gcgcccggtt ctttttgtca agaccgacct gtccggtgcc ctgaatgaac tgcaggacga 6840 

ggcagcgcgg ctatcgtggc tggccacgac gggcgttcct tgcgcagctg tgctcgacgt 6900 
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cgtcacLgaa gcgggaaggg acrggctgct attgggcgaa gtgccggggc aggatctcct 69S0 

grcacctcac cccgctcccg ccgagaaagt atccatcatg gctgatgcaa tgcggcggct 7020 

gcatacgctt gatccggcta cctgcccact cgaccaccaa gcgaaacatc gcatcgagcg 7080 

agcacgtact cggatggaag ccggtcttgt cgaccaggat gatctggacg aagagcatca 7140 

ggggcccgcg ccagccgaac tgctcgccag gctcaaggcg cgcatgcccg acggcgagga 7200 

ccrcgtcgtg acccacggcg atgcctgctt gccgaatatc atggtggaaa acggccgctt 7260 

ttctggattc atcgactgtg gccggccggg tgcggcggac cgcratcagg acatagcgrt 7320 

ggctiacccgt gatatt:gct:g aagagct:tgg cggcgaatgg gctgaccgct tccccgtgct 7380 

ttacggnacc gccgcccccg attcgcagcg catcgccttc tatcgccttc ttgacgagtt 744 0 

cctctgagcg ggactccggg gttcgaacct gtaggtttgg caagcnagaa caagggttca 7500 

atgcacttgt cgctagaaca agggttcaat gcacttgtcg ctagaacaag ggttcaatgc 756 0 

acttgccgcr agaacaaggg ttcaatgcac ttgtcgagaa ccatgggtcg agtaggcgtg 762 0 

tacggcggga ggcccatata agcagagctc gtttagtgaa ccgtcagatc gcctggagac 7680 

gccatccaac gctgtttcgc ctccaaaaga caccgggacc gaggccgcct gggcccgttt 7740 

aaacaccgat aaaataaaag attttattta gtctccagaa aaagggggga atgaaagacc 7 800 

ccacccgtag gtctggcaag ctagcttaag taacgccatt ttgcaaggca tggaaaaata 7860 

cataactgag aatagagaag ttcagatcaa ggtcaggaac agatggaaca gctgaatatg 7920 

ggccaaacag gatatctgtg gtaagcagtt cctgccccgg ctcagggcca agaacagacg 7980 

gaacagctga atatgggcca aacaggatat ctgtggtaag cagtccctgc cccggctcag 8 04 0 

ggccaagaac agatggtccc cagatgcggt ccagccctca gcagtttcta gaacaagggt 8100 

tcaatgcacr tgtcgctaga acaagggttc aatgcactcg tcgctagaac aagggttcaa 8160 

tgcacttgcc gctagaacaa gggttcaatg cacttgtcga gaaccatcag atgtttccag 8220 

ggtgccccaa ggacctgaaa tgaccctgtg ccttatttga actaaccaat cagttcgctt 8280 

ctcgctrctg cccgcgcgct tctgctcccc gagcccaata aaagagccca caacccctca 8340 

ctcggggcgc cagtccrccg attgactgag tcgcccgggt acccgtactc ccaataaacc 8400 
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